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Abstract 

This paper investigates the power of polynomial-time quantum computation in which only a very limited 
number of qubits are initially clean in the |0) state, and all the remaining qubits are initially in the totally mixed 
state. No initializations of qubits are allowed during the computation, nor intermediate measurements. The 
main results of this paper are unexpectedly strong error-reducible properties of such quantum computations. 
It is proved that any problem solvable by a polynomial-time quantum computation with one-sided bounded 
error that uses logarithmically many clean qubits can also be solvable with exponentially small one-sided error 
using just two clean qubits, and with polynomially small one-sided error using just one clean qubit (which 
in particular implies the solvability with any small constant one-sided error). It is further proved in the case 
of two-sided bounded error that any problem solvable by such a computation with a constant gap between 
completeness and soundness using logarithmically many clean qubits can also be solvable with exponentially 
small two-sided error using just two clean qubits. If only one clean qubit is available, the problem is again 
still solvable with exponentially small error in one of the completeness and soundness and polynomially small 
error in the other. As an immediate consequence of the above result for the two-sided-error case, it follows that 
the Trace Estimation problem defined With fixed constant threshold parameters is complete for BQi^gP and 
the classes of problems solvable by polynomial-time quantum computations with completeness 2/3 and 
soundness 1 /3 using logarithmically many clean qubits and just one clean qubit, respectively. The techniques 
used for proving the error-reduction results may be of independent interest in themselves, and one of the technical 
tools can also be used to show the hardness of weak classical simulations of one-clean-qubit computations (i.e., 
DQCl computations). 


1 Introduction 


1.1 Background 

An inherent nature of randomized and quantum computing is that the outcome of a computation is probabilistic and 
may not always be correct. Error reduction, or success-probability amplification, is thus one of the most fundamen¬ 
tal issues in randomized and quantum computing. Computation error can be efficiently reduced to be negligibly 
small in many standard computation models via a simple repetition followed by an OR-type, or an AND-type, or 
a threshold-value decision, depending on whether the error can happen in the original computation only for yes- 
instances, or only for no-instances, or for both. Typical examples are polynomial-time randomized and quantum 
computations with bounded error, and in particular, the error can be made exponentially small in BPP and BQP 
both in completeness and in soundness, which provides a reasonable ground for the well-used definitions of BPP 
and BQP that employ bounds 2/3 and 1/3 for completeness and soundness, respectively. In many other compu¬ 
tation models, however, it is unclear whether the error can be reduced efficiently by the simple repetition-based 
method mentioned above, and more generally, whether error reduction itself is possible. Such a situation often 
occurs when a computation model involves communications with some untrusted parties, like interactive proof sys¬ 
tems. For instance, the simple repetition-based method does work for quantum interactive proofs in the one-sided 
error case of perfect completeness, but its proof is highly nontrivial IIKW001lGut09l . Moreover, a negative evidence 
is known in the two-sided error case that the error may not be reduced efficiently via the above-mentioned simple 
method of repetition with threshold-value decision IIMW121I . although error reduction itself is anyway possible in 
this case, as any two-sided-error quantum interactive proof system can be made to have perfect completeness by 
adding more communication turns HKWOOl IKLGN151 (and the number of communication turns then can be re¬ 
duced to three). Another situation where error reduction becomes nontrivial (and sometimes impossible) appears 
when a computation model can use only very limited computational resources, like space-bounded computations. 
If the resources are too limited, it is simply impossible to repeat the original computation sufficiently many times, 
which becomes a quite enormous obstacle to error reduction in the case of space-bounded quantum computations 
when initializations of qubits are disallowed after the computation starts. Indeed, it is known impossible in the case 
of one-way quantum finite state automata to reduce computation error smaller than some constant IIAF98I . Also, it 
is unclear whether error reduction is possible or not in various logarithmic-space quantum computations. For com¬ 
putations of one-sided bounded error performed by logarithmic-space quantum Turing machines, Watrous HWatOllI 
presented a nontrivial method that reduces the error to be exponentially small. Other than this result, error-reduction 
techniques have not been developed much for space-bounded quantum computations. 

Another well-studied model of quantum computing with limited computational resources is the deterministic 
quantum computation with one quantum bit (DQCl), often mentioned as the one-clean-qubit model, which was 
introduced by Knill and Faflamme IIKF98L originally motivated by nuclear magnetic resonance (NMR) quantum 
information processing. A DQCl computation over w qubits starts with the initial state of the totally mixed state 
except for a single clean qubit, namely, |0)(0| (8) After applying a polynomial-size unitary quantum 

circuit to this state, only a single output qubit is measured in the computational basis at the end of the computa¬ 
tion in order to read out the computation result. The DQCl model may be viewed as a variant of space-bounded 
quantum computations in a sense, and is believed not to have full computational power of the standard polynomial¬ 
time quantum computation. Indeed, it is known strictly less powerful than the standard polynomial-time quantum 
computation under some reasonable assumptions IIASV061I . Moreover, since any quantum computation over the 
totally mixed state is trivially simulatable by a classical computation, the DQCl model looks easy to clas¬ 

sically simulate at first glance. Surprisingly, however, the model turned out to be able to efficiently solve several 
problems for which no efficient classical algorithms are known, such as calculations of the spectral density IKF98II . 
an integrability tester ||PFMP03i . the fidelity decay IIPBKFO041 . Jones and HOMFFY polynomials IISJ081IJW091 . 
and an invariant of 3-manifolds IIJA141I . More precisely, DQCl computations can solve the decisional versions of 
these problems with two-sided bounded error. Computation error can be quite large in such computations, and in 
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fact, the gap between completeness and soundness is allowed to be polynomially small. The only method known 
for amplifying success probability of these computations is to sequentially repeat an attempt of the computation 
polynomially many times, but this requires the clean qubit to be initialized every time after finishing one attempt, 
and moreover, the result of each attempt must be recorded to classical work space prepared outside of the DQCl 
model. It is definitely more desirable if computation error can be reduced without such initializations. The situ¬ 
ation is similar even when the number of clean qubits is allowed to be logarithmically many with respect to the 
input length. It is also known that any quantum computation of two-sided bounded error that uses logarithmically 
many clean qubits can be simulated by a quantum computation still of two-sided bounded error that uses just one 
clean qubit, but the known method for this simulation considerably increases the computational error, and the gap 
between completeness and soundness becomes polynomially small. 

1.2 Main Results 

This paper develops methods of reducing computation error in quantum computations with few clean qubits, includ¬ 
ing the DQCl model. As will be presented below, the methods proposed in this paper are unexpectedly powerful 
and provide an almost fully satisfying solution in the cases of one-sided bounded error, both to the reducibility of 
computation error and to the reducibility of the number of clean qubits. In the case of two-sided bounded error, the 
methods in this paper are applicable only when there is a constant gap between completeness and soundness in the 
original computation, but still significantly improve the situation of quantum computations with few clean qubits. 

Let QiogP(c, s), Q[i]P(c, s), and Q[2]P(c, s) denote the class of problems solvable by a polynomial-time quan¬ 
tum computation with completeness c and soundness s that uses logarithmically many clean qubits, one clean 
qubit, and two clean qubits, respectively. The rigorous definitions of these complexity classes will be found in 
Subsection [ 33 ] 

First, in the case of one-sided bounded error, it is proved that any problem solvable by a polynomial-time quan¬ 
tum computation with one-sided bounded error that uses logarithmically many clean qubits can also be solvable by 
that with exponentially small one-sided error using just two clean qubits. 

Theorem 1. For any polynomially bounded function p: Z"*" —)• N and any polynomial-time computable func¬ 
tion S'. Z"*" [0,1] satisfying 1 — s > ^ for some polynomially bounded function q: Z"*" — )• N, 

QiogP(l,5) CQ[2]P(l,2-n. 


If only one clean qubit is available, the problem is still solvable with polynomially small one-sided error (which 
in particular implies the solvability with any small constant one-sided error). 

Theorem 2. For any polynomially bounded function p: Z"*“ —>• N and any polynomial-time computable func¬ 
tion s: Z"*" [0,1] satisfying 1 — s > ^ for some polynomially bounded function q: Z"*" —N, 

QiogP(l,s) C Q[i]P(^l, 

The above two theorems are for the one-sided error case of perfect completeness, and similar statements hold 
even for the case of perfect soundness, by considering the complement of the problem. 

Corollary 3. For any polynomially bounded function p: Z"*" —?■ N and any polynomial-time computable func¬ 
tion c: Z+ —?■ [0,1] satisfying c > ^ for some polynomially bounded function q: Z^ —)• N, the following two prop¬ 
erties hold: 

(i) QiogP(c,0) CQf2 ]P(l-2-P,0), 

(ii) QiogP(c,0)CQ[i]P(l-i 0). 
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In the case of two-sided bounded error, similar statements are proved on the condition that there is a constant gap 
between completeness and soundness in the original computation. Namely, it is proved that any problem solvable 
by a polynomial-time quantum computation that uses logarithmically many clean qubits and has a constant gap 
between completeness and soundness can also be solvable by that with exponentially small two-sided error using 
just two clean qubits. 

Theorem 4. For any polynomially bounded function p : —)• N and any constants c and s in M satisfying 

0 < s < c<l, 

QiogP(c,s)CQ[2]P(l-2-P,2-P). 


If only one clean qubit is available, the problem is again still solvable with exponentially small error in one of 
the completeness and soundness and polynomially small error in the other. 

Theorem 5. For any polynomially bounded function p: TLf' —)• N and any constants c and s in M satisfying 
0 < s < c < 1, 

Ql„gP(c,») c Q|i|P(l - 2-', i) n Q„|P(l - f 2-f). 

The ideas for the proofs of these statements will be overviewed in Section |2] The techniques developed in 
the proofs may be of independent interest in themselves, and one of the technical tools can be used to show the 
hardness of weak classical simulations of DQCl computations, as will be summarized below. 

1.3 Further Results 

Completeness results for Trace Estimation problem Define the complexity classes BQjggP and BQ[^]P 
by BQiQgP = QiogP(|, 1) and BQjj^jP = Q[i]P(|, |), respectively. An immediate consequence of Theorem[5] 
is that the Trace Estimation problem is complete for BQj^gP and BQj^jP under polynomial-time many-one 
reduction, even when the problem is defined with fixed consfanf paramefers thaf specify fhe bounds on normalized 
fraces in fhe yes-insfance and no-insfance cases. 

Given a description of a quanfum circuif fhaf specifies a unifary fransformafion U, fhe Trace Estimation 
problem specified wifh fwo paramefers a and b safisfying —l<5<a<lis fhe problem of deciding whefher fhe 
real part of fhe normalized frace of U is af leasf a or if is af mosf b. 


Trace Estimation Problem: TREsT(a, b) 


Input: 

Yes Instances: 
No Instances: 


A descripfion of a quanfum circuif Q fhaf implemenfs a unifary fransformafion U over n qubifs. 
2 ^3?(tr U) > a. 

^3?(tr U) < b. 


The paper by Knill and Eaflamme IIKL98II fhaf infroduced fhe DQCl model already pointed ouf fhaf Ibis prob¬ 
lem is closely relafed fo fhe DQCl compulation. This poinf was furlher clarified in fhe succeeding lileralure (see 
Refs. IIShe06[[She091[SJ08]| . for inslance). More precisely, consider a varianl of fhe Trace Estimation problem 
where fhe fwo paramefers a and b may depend on fhe inpul lenglh (i.e., fhe lenglh of fhe description of Q). If is 
known fhaf Ibis version of fhe Trace Estimation problem, for any a and b such fhaf fhe gap a — b is bounded 
from below by an inverse-polynomial wifh respecl fo fhe inpuf lenglh, can be solved by a DQCl compulalion wifh 
some Iwo-sided bounded error where fhe completeness and soundness paramefers c and s depend on a and b. If 
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is also known that, for any two nonnegative parameters a and b such that the gap a — 5 is bounded from below 
by an inverse-polynomial with respect to the input length, the corresponding version of the Trace Estimation 
problem is hard for the complexity class Q[i]P(c, s) for some completeness parameter c and soundness parameter s 
that depend on a and b. Hence, the Trace Estimation problem essentially characterizes the power of the DQCl 
computation, except the following subtle point. One thing to be pointed out in the existing arguments above is that, 
when the parameters a and b are fixed for the Trace Estimation problem, the completeness c and soundness s 
with which the problem is in Q[]^]P(c, s) are different from the completeness c' and soundness s' with which the 
problem is hard for Q[i]P(c', s'). Namely, given two nonnegative parameters a and b of the problem, the computa¬ 
tion solves the problem with completeness c = (1 -|- a)/2 and soundness s = (1 -|- 6)/2, while the problem is hard 
for the class with completeness c' = a/4 and soundness s' = 6/4. Therefore, the existing arguments are slightly 
short for proving BQjj^jP-completeness of the Trace Estimation problem with fixed parameters a and 6 (and 
Q[i]P(c, s)-completeness of that for fixed completeness and soundness parameters c and s, in general). 

In contrast, with Theorem[5]in hand, it is immediate to show that the Trace Estimation problem is complete 
for BQjQgP and for BQj^jP for any constants a and 6 satisfying 0 < 6 < a < 1. 

Theorem 6. For any constants a and b in M satisfying 0 < 6 < o < 1, TREST(a, 6) is complete for BQ^^gP and 
for BQj^jP under polynomial-time many-one reduction. 


Hardness of weak classical simulations of DQCl computation Recently, quite a few number of studies focused 
on the hardness of weak classical simulations of restricted models of quantum computing under some reasonable 
assumptions IITD041 IBJSlll IAA131 INVdN1311.TVdN141 IMPP14IITYT141 IBrol51 ITTYT15I . Namely, a plausible 
assumption in complexity theory leads to the impossibility of efficient sampling by a classical computer according 
to an output probability distribution generatable with a quantum computing model. Among them are the IQP 
model HBJSlll and the Boson sampling IIAA13L both of which are proved hard for classical computers to simulate 
within multiplicative error, unless the polynomial-time hierarchy collapses to the third level (in fact, the main result 
of Ref. BAA 1311 is a much more meaningful hardness result on the weak simulatability of the Boson sampling within 
polynomially small additive error, but which needs a much stronger complexity assumption than the collapse of 
polynomial-time hierarchy). 

An interesting question to ask is whether a similar result holds even for the DQCl model. Very recently, Mori- 
mae, Eujii, and Eitzsimons BMEE141I approached to answering the question. They focused on the DQCl^-type 
computation, the generalization of the DQCl model that allows m output qubits to be measured at the end of the 
computation, and proved that a DQCl^-type computation with m > 3 cannot be simulated within multiplicative 
error unless the polynomial-time hierarchy collapses to the third level. Their proof essentially shows that any 
PostBQP circuit can be simulated by a DQCl 3 -type computation, where PostBQP is the complexity class corre¬ 
sponding to bounded-error quantum polynomial-time computations with postselection, which is known equivalent 
to PP BAarOSl . By an argument similar to that in Ref. BBJSlll . it follows that PP is in PostBPP (the version 
of BPP with postselection), if the DQC 13 -type computation is classically simulatable within multiplicative error. 
Together with Toda’s theorem llTod9H . this implies the collapse of the polynomial-time hierarchy to the third level. 

One obvious drawback of the existing argument above is an inevitable postselection measurement inherent to 
the definition of PostBQP. This becomes a quite essential obstacle when trying to extend this argument to the 
DQCl model, where only one qubit is allowed to be measured. 

To deal with the DQCl model, this paper takes a different approach by considering the complexity class NQP 
introduced in Ref. IIADH97B or the class SBQP introduced in Ref. |Kupl5|. Eet NQj^jP and SBQj^jP be the 
variants of NQP and SBQP, respectively, in which the quantum computation performed is restricted to the DQCl 
computation (the precise definitions of NQj^jP and SBQj^jP will be found in Subsection 13.41) . Prom one of the 
technical tools used for proving the main results of this paper (the One-Clean-Qubit Simulation Procedure 
in Subsection 14.Ill , it is almost immediate to show the following theorem that states that the restriction to the DQCl 
computation does not change the complexity classes NQP and SBQP. 
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Theorem 7. NQP = NQ^^jP and SBQP = SBQjijP. 

If any DQCl computation were classically simulatable within multiplicative error, however, the class NQjj^jP 
would be included in NP and the class SBQ^^^P would be included in SBP, where SBP is a classical version of 
SBQP in short, introduced in Ref. liBGMOhl . Similarly, if any DQCl computation were classically simulatable 
within exponentially small additive error, both NQjj^jP and SBQj^jP would be included in SBP. Combined with 
Theorem|7j any of the inclusions NQjj^jP C NP, SBQjj^jP C SBP, and NQjj^jP C SBP further implies an implau¬ 
sible consequence that PH = AM, which in particular implies the collapse of the polynomial-time hierarchy to the 
second level. Accordingly, the following theorem holds. 

Theorem 8. The DQCl model is not classically simulatable either within multiplicative error or exponentially 
small additive error, unless PH = AM. 

1.4 Organization of the Paper 

Section |2] overviews the proofs of the error reduction results, the main results of this paper. Section [3] summarizes 
the notions and properties that are used throughout this paper. Section IHprovides detailed descriptions and rigorous 
analyses of three key technical tools used in this paper. Section |5] then rigorously proves Theorems [T] and |2] and 
Corollary [3l the error reduction results in the cases of one-sided bounded error. Section |6] treats the two-sided- 
bounded-error cases, by first providing one more technical tool, and then rigorously proving Theorems |4] and |5] 
The completeness results (Theorem O and the results related to the hardness of weak classical simulations of the 
DQCl model (Theorems |7] and [H) are proved in Sections |7] and [H respectively. Finally, Section |9] concludes the 
paper with some open problems. 

2 Overview of Error Reduction Results 

This section presents an overview of the proofs for the error reduction results. First, Subsection 12.11 provides 
high-level descriptions of the proofs of Theorems [T] and |2l the theorems for the one-sided error case of perfect 
completeness. Compared with the two-sided-error case, the proof construction is relatively simpler in the perfect¬ 
completeness case, but already involves most of key technical ingredients of this paper. Subsection l2.2l then explains 
the further idea that proves Theorems ID and |5l the theorems for the two-sided-error case. 

2.1 Proof Ideas of Theorems [D and |2] 

Let A = (Ayes, Ano) be any problem in QiogP(l,s), where the function s defining fhe soundness is bounded 
away from one by an inverse-polynomial, and consider a polynomial-lime uniformly generaled family of quanlum 
circuifs fhal puls A in QiogP(l,s). Lei Qx denote the quantum circuit from this family when the input is x, 
where Qx acts over tc(|x|) qubits for some polynomially bounded function w, and is supposed to be applied to the 
initial state (g) exactly k{\x\) clean qubits, for some logarithmically 

bounded function k. 

Theorems [U and |2] are proved by constructing circuits with desirable properties from the original circuit Qx- 
The construction is essentially the same for both of the two theorems and consists of three stages of transforma¬ 
tions of circuits: The first stage reduces the number of necessary clean qubits to just one, while keeping perfect 
completeness and soundness still bounded away from one by an inverse-polynomial. The second stage then makes 
the acceptance probability of no-instances arbitrarily close to 1/2, still using just one clean qubit and keeping per¬ 
fect completeness. Here, it not only makes the soundness (i.e., the upper bound of the acceptance probability of 
no-instances) close to 1/2, but also makes the acceptance probability of no-instances at least 1/2. Finally, in the 
case of Theorem |2l the third stage further reduces soundness error to be polynomially small with the use of just 
one clean qubit, while preserving the perfect completeness property. If one more clean qubit is available, the third 
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stage can make soundness exponentially small with keeping perfect completeness, which leads to Theorem [T] The 
analyses of the third stage effectively use the fact that the acceptance probability of no-instances is close to 1/2 
after the transformation of the second stage. The rest of this subsection sketches the ideas that realize each of these 
three stages. 

One-Clean-Qubit Simulation Procedure The first stage uses a procedure called the One-Clean- 
Qubit Simulation Procedure. Given the quantum circuit Qx with a specification of the number A:(|x|) of 
clean qubits, this procedure results in a quantum circuit Rx such that the input state to Rx is supposed to contain 
just one clean qubit, and when applied to the one-clean-qubit initial state, the acceptance probability of Rx is 
still one if x is in Ayes, while it is at most 1 — 5(|x|) if x is in A^o, where 6 is an inverse-polynomial function 
determined by <5 = 2“^(1 — s). It is stressed that the One-Clean-Qubit Simulation Procedure preserves 
perfect completeness, which is in stark contrast to the straightforward method of one-clean-qubit simulation. 

Consider the A:(|x|)-clean-qubit computation performed with Qx- Let Q denote the quantum register con¬ 
sisting of the k{\x\) initially clean qubits, and let R denote the quantum register consisting of the remaining 
r(;(|x|) — k{\x\) qubits that are initially in the totally mixed state. Further let denote the single-qubit quantum 
register consisting of the first qubit of Q, which corresponds to the output qubit of Qx- In the one-clean-qubit 
simulation of Qx by Rx, the k{\x\) qubits in Q are supposed to be in the totally mixed state initially and Rx tries to 
simulate Qx only when Q initially contains the clean all-zero state. To do so, Rx uses another quantum register Q 
consisting of just a single qubit, and this qubit in Q is the only qubit that is supposed to be initially clean. 

For ease of explanations, assume for a while that all the qubits in Q are also initially clean even in the case 
of Rx- The key idea in the construction of Rx is the following simulation of Qx that makes use of the phase- 
flip transformation: The simulation first applies the Hadamard transformation H to the qubit in 0 and then flips 
the phase if and only if the content of Q is 1 and the simulation of Qx results in rejection (which is realized 
by performing Qx to (Q, R) and then applying the controlled-Z transformation to (Q, where the content 1 

in is assumed to correspond to the rejection in the original computation by Qx)- The simulation further 
performs the inverse of Qx to (Q, R) and again applies the Hadamard transformation H to Q. At the end of the 
simulation, the qubit in Q is measured in the computational basis, with the measurement result 0 corresponding 
to acceptance. The point is that this phase-flip-based construction provides a quite “faithful” simulation of Qx, 
meaning that the rejection probability of the simulation is polynomially related to the rejection probability of the 
original computation of Qx (and in particular, the simulation never rejects when the original computation never 
rejects, i.e., the simulation preserves the perfect completeness property). 

As mentioned before, all the qubits in Q are supposed to be in the totally mixed state initially in the one- 
clean-qubit simulation of Qx by Rx, and Rx tries to simulate Qx only when Q initially contains the clean all-zero 
state. To achieve this, each of the applications of the Hadamard transformation is replaced by an application of 
the controlled-Hadamard transformation so that the Hadamard transformation is applied only when all the qubits 
in Q are in state |0). By considering the one-clean-qubit computations with the circuit family induced by Rx, the 
perfect completeness property is preserved and soundness is still bounded away from one by an inverse-polynomial 
(although the rejection probability becomes smaller for no-instances by a multiplicative factor of 2“^, where no¬ 
tice that 2~^ is an inverse-polynomial as A: is a logarithmically bounded function). The construction of Rx is 
summarized in Figure [T] A precise description and a detailed analysis of the One-Clean-Qubit Simulation 
Procedure will be presented in Subsection 14. II 

Randomness Amplieication Procedure The second stage uses the procedure called the Randomness 
Amplieication Procedure. Given the circuit Rx constructed in the first stage, this procedure results in a 
quantum circuit R'x such that the input state to R'x is still supposed to contain just one clean qubit, and when 
applied to the one-clean-qubit initial state, the acceptance probability of R'x is still one if x is in Ayes, while it is in 
the interval ^ e(|x|)] if x is in Ano for some sufficiently small function e- 
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One-Clean-Qubit Simulation Procedure — Simplified Description 


1. Prepare a single-qubit register 0, a /c(|a:|)-qubit register Q, and a (rt;(|x|) — /c(|x|))-qubit register R, where 
the qubit in 0 is supposed to be initially in state |0), while all the qubits in Q and R are supposed to be 
initially in the totally mixed state //2. 

Apply if to 0 if all the qubits in Q are in state |0). 

2. Apply to (Q, R). 

3. Apply the phase-flip (i.e., multiply the phase by —1) if the eontent of (0, is 11, where denotes the 
single-qubit register eonsisting of the first qubit of Q. 

4. Apply qI to (Q, R). 

5. Apply if to 0 if all the qubits in Q are in state |0). Measure the qubit in 0 in the eomputational basis. Aeeept 
if this results in |0), and rejeet otherwise. 


Figure 1: The One-Clean-Qubit Simulation Procedure indueed by a quantum eireuit Qx with the speei- 
fieation of the number A:(|x|) of elean qubits used in the eomputation of Qx to be simulated (a slightly simplified 
deseription). 


Consider the one-elean-qubit eomputation performed with Rx- Let 0 denote the single-qubit quantum register 
eonsisting of the initially elean qubit, whieh is also the output qubit of Rx- Let R denote the quantum register 
eonsisting of all the remaining qubits that are initially in the totally mixed state (by the eonstruetion of R 
eonsists of m(|x|) qubits). 

Suppose that the qubit in 0 is measured in the eomputational basis after Rx is applied to the one-elean-qubit 
initial state |0)(0| (8> in (0, R). Obviously from the property of Rx, the measurement results in 0 with 

probability exaetly equal to the aeeeptanee probability pacc of the one-elean-qubit eomputation with Rx- Now sup¬ 
pose that Rx is applied to a slightly different initial state |1)(1| (8> in (0, R), where 0 initially eontains 

|1) instead of |0) and all the qubits in R are again initially in the totally mixed state. The key property here to 
be proved is that, in this ease, the measurement over the qubit in 0 in the eomputational basis results in 1 again 
with probability exaetly Pa,cc, the aeeeptanee probability of the one-elean-qubit eomputation with Rx- This implies 
that, after the applieation of Rx to (0, R) with all the qubits in R being in the totally mixed state, the eontent of 
0 remains the same with probability exaetly Pa,cc, and is flipped with probability exaetly 1 — pacc? the rejeetion 
probability of the original one-elean-qubit eomputation with Rx, regardless of the initial eontent of 0. 

The above observation leads to the following eonstruetion of the eireuit R'x- The eonstruetion of R'x is basieally 
a sequential repetition of the original eireuit Rx- The number N of repetitions is polynomially many with respeet 
to the input length |x|, and the point is that the register 0 is reused for eaeh repetition, and only the qubits in R 
are refreshed after eaeh repetition (by preparing N registers Ri,..., Rtv, eaeh of whieh eonsists of t(;(|x|) qubits, 
the same number of qubits as R, all of whieh are initially in the totally mixed state). After eaeh repetition the 
qubit in 0 is measured in the eomputational basis (in the aetual eonstruetion, this step is exaetly simulated without 
any measurement — a single-qubit totally mixed state is prepared as a fresh aneilla qubit for eaeh repetition so 
that the eontent of 0 is eopied to this aneilla qubit using the CNOT transformation, and this aneilla qubit is never 
touehed after this CNOT applieation). Now, no matter whieh measurement result is obtained at the jth repetition 
for every j in {!,... ,N}, the register 0 is reused as it is, and the eireuit Rx is simply applied to (0, Rj+i) at the 
(j -|- l)st repetition. After the N repetitions, the qubit in 0 is measured in the eomputational basis, whieh is the 
output of R'x (the output 0 eorresponds to aeeeptanee). 
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Randomness Amplification Procedure — Simplified Description 

1. Prepare a single-qubit register 0, where the qubit in 0 is supposed to be initially in state |0). Prepare a 
r(;(|x|)-qubit register Rj for each j in {1,..., where all the qubits in Rj are supposed to be initially in 
the totally mixed state 1/2. 

2. For y = 1 to N, perform the following: 

2.1. Apply Rx to (0, Rj). 

2.2. Measure the qubit in 0 in the computational basis. 

3. Accept if the qubit in 0 is in state |0), and reject otherwise. 


Figure 2: The RANDOMNESS Amplification Procedure (a slightly simplified description). 


The point is that at each repetition, the content of 0 is flipped with probability exactly equal to the rejection 
probability of the original one-clean-qubit computation of Rx- Taking into account that 0 is initially in state |0), 
the computation of R'^ results in acceptance if and only if the content of 0 is flipped even number of times during 
the N repetitions. An analysis on Bernoulli trials then shows that, when the acceptance probability of the original 
one-clean-qubit computation of Rx was in the interval [^, l), the acceptance probability of the one-clean-qubit 
computation of R'x is at least 1/2 and converges linearly to 1/2 with respect to the repetition number. On the other 
hand, when the acceptance probability of the original Rx was one, the content of 0 is never flipped during the 
computation of R'x, and thus the acceptance probability of remains one. Figure |2] summarizes the construction 
of R'x- A precise description and a detailed analysis of the RANDOMNESS Amplification Procedure will be 
presented in Subsection 14.21 

Stability Checking Procedures In the case of Theorem[2l the third stage uses the procedure called the 
One-Clean-Qubit Stability Checking Procedure. Given the circuit R'^ constructed in the second stage, 
this procedure results in a quantum circuit R'/ such that the input state to R'/ is still supposed to contain just one 
clean qubit, and when applied to the one-clean-qubit initial state, the acceptance probability of R'/ is still one if x 
is in Ayes, while it is 1/p{\x\) if x is in A^o for a polynomially bounded function p predetermined arbitrarily. 

Consider the one-clean-qubit computation performed with R'x- Let Q denote the single-qubit quantum register 
consisting of the initially clean qubit, which is also the output qubit of Let R denote the quantum register 
consisting of all the remaining qubits that are initially in the totally mixed state, and let n;'(|x|) denote the number 
of qubits in R. 

Again the key observation is that, after the application of to (Q, R) with all the qubits in R being in the 
totally mixed state (followed by the measurement over the qubit in Q in the computational basis), the content of 
Q is flipped with probability exactly equal to the rejection probability of the original one-clean-qubit computation 
with R'^, regardless of the initial content of Q. 

This leads to the following construction of the circuit R!/. The construction of R!/ is again basically a sequential 
repetition of the original circuit R'^, but this time the qubit in Q is also supposed to be initially in the totally 
mixed state. The circuit R!x is repeatedly applied 2N times, where is a power of two and is polynomially 
many with respect to the input length |x|, and again the register Q is reused for each repetition, and only the 
qubits in R are refreshed after each repetition (by preparing 2N registers Ri,..., R 2 N, each of which consists of 
m'(|x|) qubits, all of which are initially in the totally mixed state). The key idea for the construction of R'/ is to use 
a counter that counts the number of attempts such that the measurement over the qubit in Q results in |1) after the 




One-Clean-Qubit Stability Checking Procedure — Simplified Description 

1. Given a positive integer N that is a power of two, prepare a counter C whose value is taken modulo 2N. 

Choose an integer r from {0,..., — 1} uniformly at random, and initialize a counter C to r (which sets 

the most significant bit of C to 0). Prepare a single-qubit register Q and a ■(/;'(|x|)-qubit register Rj for each 
j in {1,, 2N}, where all the qubits in Q and Rj are supposed to be initially in the totally mixed state I /2. 

2. For y = 1 to 2N, perform the following: 

2.1. Apply R'^ to (Q, Rj). 

2.2. Measure the qubit in Q in the computational basis. If this results in 11), increase the counter C by one. 

3. Reject if < C < 2N — 1, and accept otherwise (i.e., accept iff the most significant bit of C is 0). 


Figure 3: The One-Clean-Qubit Stability Checking Procedure (a simplified descripfion). 

application of R'^ (again each measuremenf is simulafed by a CNOT applicafion using an ancilla qubif of a fofally 
mixed slate). Notice lhaf Ihe conlenl of Q is never flipped regardless of fhe initial conlenl of Q, if fhe original 
accepfance probabilily is one in fhe one-clean-qubil compufafion wifh R'^. Hence, in Ihis case fhe counfer value 
eifher slafionarily remains ifs initial value or is increased exaclly by 2N, fhe number of repelifions. On fhe ofher 
hand, if fhe original accepfance probabilily is close lo 1/2 in fhe one-clean-qubil compufafion wifh R'^, fhe conlenl 
of Q is flipped wifh probabilily close lo 1/2 afler each applicafion of R'^ regardless of fhe inifial conlenl of Q. This 
means lhaf, afler each applicafion of R'^, fhe measuremenf over fhe qubif in Q resulfs in 11) wifh probabilily close 
lo 1/2 regardless of fhe inifial conlenl of Q, and Ihus, fhe incremenl of fhe counter value musl be dislribuled around 
i • 2N = N wifh very high probabilily. Now, if fhe counfer value is laken modulo 2N and if fhe unique inilially 
clean qubif is prepared for fhe mosl significanl bil of fhe counfer (which picks fhe initial counfer value from fhe 
sel {0,..., N — 1} uniformly al random), fhe compufalional-basis measuremenf over Ihis mosl significanl qubif 
of fhe counter always resulfs in |0) if x is in Ayes, while if resulfs in |1) wifh very high probabilily if x is in A^o 
(which can be made al leasl 1 — for an arbilrarily chosen polynomially bounded function p, by faking an 
appropriately large number 2N of fhe repetition). Figure [3] summarizes fhe conslruclion of R". 

One drawback of fhe conslruclion of R" above via fhe One-Clean-Qubit Stability Checking Proce¬ 
dure is lhaf, in fhe case of no-inslances, Ihere inevilably exisl some “bad” inifial counter values in {0,..., N — 1} 
wifh which R” is forced lo accepl wifh unallowably high probabilily. For inslance, if fhe inifial counfer value is 0, 
R" is forced lo accepl when fhe incremenl of fhe counfer is less fhan N, which happens wifh probabilily al leas! 
a conslanl. This is fhe essential reason why fhe currenl approach achieves only a polynomially small soundness in 
fhe one-clean-qubil case in Theorem|2l as fhe number of possible inifial counfer values can be al mosl polynomially 
many (ofherwise fhe number of repelifions musl be super-polynomially many) and even jusl one “bad” initial value 
is problemalic lo go beyond polynomially small soundness. In conlrasf, if nol jusl one buf fwo clean qubifs are 
available, one can remove fhe possibilily of “bad” initial counfer values, which resulfs in fhe Two-Clean-Qubit 
Stability Checking Procedure. This time, Ihe circuil R'^ is repeatedly applied 8N times, and fhe counter 
value is laken modulo 8N. The fwo initially clean qubifs are prepared for fhe mosl and second-mosl significanl bifs 
of fhe counfer, which resulfs in picking fhe inifial counfer value from fhe sel {0,... , 2N — 1} uniformly al random. 
Now fhe poinl is lhaf fhe counfer value can be increased by N before fhe repelifion so lhaf fhe aclual inifial value 
of fhe counter is in {N ,..., 3N — 1}, which discards fhe fail sefs {0,..., N — 1} and {3A^,..., AN — 1} of fhe 
sel {0,..., AN — 1}. As fhe size of fhe fail sefs discarded is sufficienlly large, Ihere no longer exisls any “bad” 
initial counter value, which leads lo fhe exponenlially small soundness in fhe Iwo-clean-qubil case in Theorem [T] 
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Precise descriptions and detailed analyses of the One-Clean-Qubit Stability Checking Procedure 
and Two-Clean-Qubit Stability Checking Procedure will be presented in Subsection l43l 

2.2 Proof Ideas of Theorems IH and |5] 

The results for the two-sided error case need more complicated arguments and is proved in eight stages of transfor¬ 
mations in total, which are split into three parts. 

The first part consists of three stages, and proves that any problem solvable with constant completeness and 
soundness using logarithmically many clean qubits is also solvable with constant completeness and soundness using 
just one clean qubit. At the first stage of the first part, by a standard repetition with a threshold-value decision, one 
first reduces errors to be sufficiently small constants, say, completeness 15/16 and soundness 1/16. For this, if 
the starting computation has a constant gap between completeness and soundness, one requires only a constant 
number of repetitions, and thus, the resulting computation still requires only logarithmically many clean qubits. 
The second stage of the first part then reduces the number of clean qubits to just one. The procedure in this stage 
is exactly the One-Clean-Qubit Simulation Procedure developed in the first stage of the one-sided error 
case. The gap between completeness and soundness becomes only an inverse-polynomial by this transformation, 
but the point is that the gap is still sufficiently larger (i.e. a constant times larger) than the completeness error. Now 
the third stage of the first part transforms the computation resulting from the second stage to the computation that 
still uses only one clean qubit and has constant completeness and soundness. The procedure in this stage is exactly 
the Randomness Amplification Procedure, developed in the second stage of the one-sided error case, and 
it makes use of the difference of the rates of convergence to 1/2 of the acceptance probability between the yes- and 
no-instance cases. The precise statement corresponding to the first part is found as Lemma [Ml in Subsection 16.21 

The second part consists of two stages, and proves that any problem solvable with constant completeness 
and soundness using just one clean qubit is also solvable with almost-perfect (i.e., exponentially close to one) 
completeness and soundness below 1/2 using just logarithmically many clean qubits. At the first stage of the 
second part, one reduces both of the completeness and soundness errors to be polynomially small, again by a 
standard repetition with a threshold-value decision. Note that the computation resulting from the first part requires 
only one clean qubit. Thus, even when repeated logarithmically many times, the resulting computation uses just 
logarithmically many clean qubits, and achieves polynomially small errors. The second stage of the second part 
then repeatedly attempts the computation resulting from the first stage polynomially many times, and accepts if 
at least one of the attempts results in acceptance (i.e., takes OR of the attempts). A straightforward repetition 
requires polynomially many clean qubits, and to avoid this problem, after each repetition one tries to recover the 
clean qubits for reuse by applying the inverse of the computation (the failure of this recovery step is counted as an 
“acceptance” when taking the OR). This results in a computation that still requires only logarithmically many clean 
qubits, and has completeness exponentially close to one, while soundness is still below 1/2. The precise statement 
corresponding to the second part is found as Lemma [25] in Subsection 16.21 

Now the third part is essentially the same as the three-stage transformation of the one-sided error case. From 
the computation resulting from the second part, the first stage of the third part decreases the number of clean qubits 
to just one, via the One-Clean-Qubit Simulation Procedure. The completeness of the resulting computa¬ 
tion is still exponentially close to one and its soundness is bounded away from one by an inverse-polynomial. The 
second stage of the third part then applies the RANDOMNESS Amplification Procedure to make the accep¬ 
tance probability of no-instances arbitrarily close to 1/2, while keeping completeness exponentially close to one. 
Finally, the third stage of the third part proves that one can further decrease soundness error to be polynomially 
small using just one qubit via the One-Clean-Qubit Stability Checking Procedure, or to be exponen¬ 
tially small using just two qubits via the Two-Clean-Qubit Stability Checking Procedure, while keeping 
completeness exponentially close to one. 

By considering the complement problem, the above argument can also prove the case of exponentially small 
soundness error in Theorem [5] 
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3 Preliminaries 


Throughout this paper, let N and denote the sets of positive and nonnegative integers, respectively, and let 
S = {0,1} denote the binary alphabet set. A function /: —>• N is polynomially bounded if there exists a 

polynomial-time deterministic Turing machine that outputs on input 1”. A function /: Z+ —)• N is loga¬ 
rithmically bounded if / is polynomial-time computable and /(n) is in O(logn). A function /: [0,1] is 

negligible if, for every polynomially bounded function g: Z+ N, it holds that /(n) < l/g{n) for all but finitely 
many values of n. 


3.1 Useful Properties on Random Variables 

This subsection presents two lemmas on properties of the random variable that follows the binomial distribution, 
which are used in this paper. 

The first lemma is a special case of the Hoeffding inequality. 


Lemma 9. For any n in N and p in [0,1], let X be a random variable over {0,... , n} that follows the binomial 
distribution B{n,p). Then, for any 5 in (0,1), 


Pr 


’V 

< and Pr 

r A _1 

— > p 5 

lA 

1 

C>1 

n 


n 


< e 


—25^n 


The second lemma is on the probability that a random variable takes an even number when it follows the 
binomial distribution. 

Lemma 10. For any n in N and p in [0,1], let X be a random variable over {0,... , n} that follows the binomial 
distribution B{n,p). Then, 

Pr[X is even] ~ ^ “ 2p)". 


Proof For each j in {1,..., n}, let Yj be an independent random variable over {—1,1} that takes —1 with proba¬ 
bility p, and let Z be a random variable over {0,1} defined by 


Z = 



Notice fhaf Z is 1 if and only if fhere are even number of indices j such fhaf Yj is —1. Hence, 


Pr|X is even] = E[Z] = } + } E 


n^j 


Lj=i 


11 11 
?+5 n EKi=~+- 2 p)”. 


1=1 


where fhe fhird equably uses Ihe facl fhaf each Yj is an independenl random variable, and Ihe claim follows. □ 


3.2 Quantum Fundamentals 

We assume Ihe reader is familiar wilh fhe quanlum formalism, including pure and mixed quanlum slafes, density 
operators, and measuremenls, as well as fhe quanlum circuil model (see Refs. llNC001[KSV021IWill3L for inslance). 
This subsection summarizes some nofafions and properlies fhaf are used in Ihis paper. 

For every posifive inleger n, lei C(S’*) denole fhe 2*^-dimensional complex Hilberl space whose slandard basis 
vectors are indexed by fhe elemenls in YY. In fhis paper, all Hilberl spaces are complex and have dimension a 
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power of two. For a Hilbert space T-L, let I'M. denote the identity operator over T-L, and let U(^) denote the set of 
unitary operators over T-L. As usual, let 


X = 




denote the Pauli operators, and let 


H = 


J_A 1 

V2 U -1 


S = 


1 0 
0 i 


T = 


denote the Hadamard, i-phase-shift, and T operators, respectively (the T operator corresponds to the vr/S-gate). 
Notice that S = T'^ and Z = 3“^ = T^. For convenience, we may identify a unitary operator with the unitary 
transformation it induces. In particular, for a unitary operator U, the induced unitary transformation is also denoted 
by U. 

For a Hilbert space T-L and a unitary operator U in U(^), let A(C/) denote the controlled-U operator in 
U(C(S)(g)^) defined by 

A{U) = |0)(0|®/« + |l)(l|®[/. 

For any positive integer n> 2, the n-controlled-U operator in U(C(S”) (g) T-C) is recursively defined by 

A^{U) = A(A-i(C/)) = |0)(0| ® ® 

where A^(C/) may be interpreted as the controlled-f7 operator A{U). In the case where (7 is a unitary transforma¬ 
tion over a single qubit, notice that the last qubit is the target qubit for each of the unitary transformations A{U) 
and A”(C/) in the notation above. For notational convenience, for each positive integer n and for each integer j 
in {1,... , n + 1}, let A”(C/) denote the case of the n-controlled-f7 operator in which the corresponding transfor¬ 
mation uses the jth qubit as the target qubit. The operator A”(C/) corresponds to A"^^(C/) in this notation. The 
operator A}(C/) may be simply denoted by Ai(C/). 


3.3 Quantum Circuits 

A quantum circuit is specified by a series of quantum gates with designation of qubits to which each quantum gate 
is applied. It is assumed that any quantum circuit is composed of gates in some reasonable, universal, finite set of 
quantum gates. A description of a quantum circuit is a string in S* that encodes the specification of the quantum 
circuit. The encoding must be a “reasonable” one, i.e., the number of gates in a circuit encoded is not more than 
the length of the description of that circuit, and each gate of the circuit is specifiable by a deferministic procedure 
in time polynomial with respect to the length of the description. 

A family {Qx}xeE* of quantum circuits is polynomial-time uniformly generated if there exists a polynomial¬ 
time deterministic Turing machine that, on every input x in S*, outputs a description of Qx- For convenience, we 
may identify a circuit Qx with the unitary operator it induces. 

Remark. Notice that the input x is “hard-coded” in the generated circuit Qx in the above definition of polynomial¬ 
time uniformly generated family of quantum circuits, i.e., each circuit Qx depends on the input x itself. The choice 
of this “hard-coded” definition is just for ease of explanations, and all the results in this paper do remain valid 
even with a more standard definition of the polynomial-time uniformity where each circuit generated depends only 
on the input length \x\, and the input x is given to the circuit as a read-only input. In fact, all the results still 
remain valid even when using the logarithmic-space uniformly generated family of quantum circuits to define fhe 
complexity classes in Subsection l3.4l (bv suitably replacing polynomial-time computable functions by logarithmic- 
space computable functions in some statements and by changing the definitions of polynomially and logarithmically 
bounded functions by using logarithmic-space deterministic Turing machines and logarithmic-space computable 
functions). See Subsection 13. 5l for further discussions on the uniformity of quantum circuits. 
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This paper assumes a gate set that includes the Hadamard, T, and CNOT gates. Note that this assumption is 
satisfied by many standard gate sets, and is very reasonable and not restrictive. In particular, the gate set proposed 
in Ref. iBMP~*~00l exactly consists of these three gates. Some useful transformations are in order that are exactly 
implementable with such a gate set using or not using ancilla qubits: 


Transformations corresponding to Clifford-group operators First note that, with a gate set satisfying the 
assumption above, any transformation corresponding to a Clifford-group operator is exactly implementable without 
using any ancilla qubits. 

In particular, the phase-flip transformation Z is nothing but T^, and is easily realized. Thus, the NOT transfor¬ 
mation (the operator X) is also easily realizable, for X = HZH. 

As Z = HXH, the controlled-Z transformation A(Z) is also realizable by using the decomposition 

A(Z) = {I (Z H) A{X){I (Z H), 

where A(X) is nothing but the CNOT transformation. 


Generalized Toffoli transformations Using some ancilla qubits, any generalized Toffoli transformation (i.e., the 
n-controlled-NOT transformation A'^{X) for any positive integer n), is also exactly implementable with the gate set 
discussed, according to the constructions in Ref. ||BBC'*~95|| . In the construction in Lemma 7.2 of Ref. ||BBC'*~95|| . 
the number of necessary ancilla qubits grows linearly with respect to the number of control qubits, and the con¬ 
struction in Corollary 7.4 of Ref. ||BBC'*~95l uses only two ancilla qubits when n > 5. One very helpful property 
is that no initializations are required for all these ancilla qubits (and thus, all of them can actually be re-used when 
applying other generalized Toffoli transformations). In particular, even totally mixed states may be used for these 
ancilla qubits, and hence, with the above-mentioned gate set, generalized Toffoli transformations may be assumed 
available freely when constructing quantum circuits that are used for A:-clean-qubit computations defined formally 
in Subsection 13.41 See Lemma 7.2 and Corollary 7.4 of Ref. llBBC^95t for details. 


Controlled-Hadamard transformations Recall that the Hadamard operator H is decomposed as 

H = S^HT^XTHS. 

This implies that the n-controlled-Hadamard transformation A” (77) for any positive integer n is decomposed as 

A”(77) = (7®*" ® 5^) (7®*" ® 77) (7®" 0 T^) A’^(A:) (7®" ® T) (7®” 0 77) (7®” ® S). 

As S = T^, = T^, and = T®, the A” (77) transformation for each n is exactly implementable by using two 

Hadamard gates, sixteen T gates, and one generalized Toffoli transformation (i.e., one n-controlled-NOT transfor¬ 
mation A"'{X)). Clearly, the only necessary ancilla qubits are those used for realizing the A”(X) transformation 
in this implementation. Hence, provided that generalized Toffoli transformations may be assumed available freely, 
for all n, n-controlled-Hadamard transformations may also be assumed available freely. 

Increment transformations For any positive integer n, let 7/+i(Z2") denote the increment transformation 
over Z 2 'i, which is the unitary transformation acting over n qubits defined by 

C^-l-i(^2"): \j) |(7 + 1) mod 2”'), Vj G Z2^ 


Note that 

7/+i(Z2) =X, 
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and for each positive integer n > 2, 

C/+i(Z2n) = (/®C/+i(Z2„-i)) 

where recall that corresponds to the (n — l)-controlled-NOT transformation with the first quhit being 

the target. Hence, for each positive integer n >2, 

U+i{Z2^) = (g) X) (/®(«-2) (g, Ai(X)) • • • (/ (8) A"-2(X)) A”-^(X), 

and thus, each C/+i(Z 2 ") transformation is exactly implementable by combining NOT, CNOT, and generalized 
Toffoli transformations only. Note that ancilla qubits are required only for realizing the generalized Toffoli transfor¬ 
mations in this implementation. Accordingly, provided that generalized Toffoli transformations may be assumed 
available freely, the increment transformation over Z 2 " may also be assumed available freely, for each positive 
integer n, and so may the controlled-C/+i(Z 2 n) transformation A([/+i(Z 2 ")), by its construction. 


Threshold-check transformations For any integers t and z, let />j: Z —)■ {0,1} denote the function defined by 




0 if z < t, 
1 if z > t. 


For any positive infeger n and for an infeger t in Z 2 ", The fhreshold-check fransformafion over Z 2 " is 

fhe unifary fransformafion acfing over n + 1 qubifs defined by 

f7>t(Z2n): |6) (g) \j) |6© />t(j)) ® |j)> G {0,1}, Vj G Z 2 ™. 

The fhreshold-check fransformafion [/>t(Z 2 ") over 7,2" is easily implemenfed as follows, combining fhe incremenf 
fransformafions C/+i(Z 2 n+i) and [/+i(Z 2 ") over Z 2 ™+i and Z 2 n, respectively: 

U>t{72^) = (I® t/+l(Z2n))*([/ + l(Z2n+l))"""*. 

Accordingly, provided fhaf generalized Toffoli fransformafions may be assumed available freely, fhe fhreshold- 
check fransformafion U>t (Z 2 >^) over Z 2 n may also be assumed available freely, for each positive infeger n. 


3.4 fc-Clean-Qubit Computation and Complexity Classes 

For any positive integer k, a quantum computation with k clean qubits, or simply a k-clean-qubit computation, 
is a computation performed by a unitary quantum circuit Q acting over w qubits, where m is a positive integer 
satisfying w > k. It is assumed that one of the qubits to which the circuit Q is applied is designated as the output 
qubit. The /c-clean-qubit computation specified by the circuit Q proceeds as follows. For simplicity, we identify 
the quantum circuit Q with the unitary operator it induces. The initial state of the computation is the ru-qubit state 

A"; ’ = (loxoi)®'-' ® (2) 

The circuit Q is applied to this initial state, which generates the m-qubit state 

Pfinal = 

Now the designated output qubit is measured in the computational basis, where the outcome |0) is interpreted 
as “accept”, and the outcome 11) is interpreted as “reject”. Such a computation may also be called a quantum 
computation of DQCk type, or simply a DQCk computation, in analogy to the DQCl computation. 

The complexity classes Q[j]P(c, s) and QyP(c, s) are defined as follows. 
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Definition 11. Given a function /: Z+ —>• N and functions c, s: [0,1] satisfying c > s, a promise prob¬ 

lem A = (Ayes, Ano) is in Q[j]P(c, s) iff there exists a polynomial-time uniformly generated family {Qxjxss* of 
quantum circuits such that, for every x in S*, Qx acts over ru(|x|) qubits for some polynomially bounded func¬ 
tion w: Z+ —>• N satisfying w > f and has the following properties: 

(Completeness) if x is in Ayes, the /(|x|)-clean-qubit computation induced by Qx results in acceptance with 
probability at least c(|x|), 

(Soundness) if x is in Ano, the /(|x|)-clean-qubit computation induced by Qx results in acceptance with proba¬ 
bility at most s(|x|). 

Definition 12. Given a function /: 'lA —>• N and functions c, s : lA [0,1] satisfying c > s, a promise prob¬ 
lem A = (Ayes, Ajio) is in QyP(c, s) iff A is in Q[g]P(c, s) for some function g-. TL'^ N satisfying g G 0(/). 


Using these definitions, the complexity classes BQ[j]P and BQjP are defined as follows. 

Definition 13. Given a function /: ILA ^ N, a promise problem A = (Ayes, Ano) is in BQ[j]P iff A is in 
Q[/]^(3’ s)- 

Definition 14. Given a function /: Z+ —N, a promise problem A = (Ayes, Ano) is in BQ^P iff A is in 
Q/^(3’ s)- 

Some remarks are in order on the definitions of BQjjjP and BQjP. 

By Theorem HI to be proved, for any logarithmically bounded function /, the above definition of BQjP 
is equivalent to a more conservative definition where the class consists of problems that are in QfP(l — e,e) 
for some negligible function e: lA —)• [0,1]. Similarly, for any logarithmically bounded function / > 2, the 
class BQ[jjP above is equivalent to the class of problems that are in Q[j]P(l — e,e) for some negligible func¬ 
tion e: TA [0,1]. Another candidate of definitions is to use BQ[j]P' and BQjP', which are the unions of 
Q[y]P(c, s) and QjP(c, s), respectively, over all functions c, s : lA [0,1] satisfying c — s > - for some poly¬ 
nomially bounded function p: TA —)■ N. As the computation error can be reduced by Theorem |4] only when there 
is a constant gap between completeness and soundness in the starting computation, it remains open whether these 
BQ[j]P' and BQjP' are equal to BQ[j]P and BQjP defined above. 

Finally, the complexity classes NQ[i]P and SBQjj^jP are defined as follows, which are the one-clean-qubit- 
computation analogues of NQP and SBQP, respectively. 

Definition 15. A promise problem A = (Ayes, Ano) is in NQ[^]P iff A is in Q[i]P(c, 0) for some positive-valued 
function c: TlA (0, Ij. 

Definition 16. Given a function /; lA —)• N, a promise problem A = (Ayes, Ano) is in SBQj^jP iff A is in 
Q[i]P(2“^, for some polynomially bounded function p: lA N. 

Remark. As will be proved in Subsection l8.21 the class SBQjj^jP has the following amplification property similar to 
SBQP and SBP: a problem A = (Ayes, Ano) is in SBQj^jP iff for any polynomially bounded function p : Z+ — >• N, 
there exists a polynomially bounded function q: Z+—)■ N such that A is in Q[i]P(2 • (1 — 2 p), 2 • 2 p). 

3.5 Remarks on Uniformity of Quantum Circuits 

The definition of polynomial-time uniformly generated family of quantum circuits in Subsection 13.31 allows the 
input X to be “hard-coded” in the generated circuit Qx, i.e., each circuit Qx is allowed to depend on the input x 
itself. As mentioned before, a more standard definition of the polynomial-time uniformity is the one in which each 
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circuit generated depends only on the input length |x|, and the input x is given to the circuit as a read-only input. It 
is stressed that all the results in this paper remain valid even when this more standard “non-hard-coded” definition 
is adopted for the polynomial-time uniformity of quantum circuits. It is further stressed that all the results in this 
paper still remain valid even when the complexity classes in Subsection (33 are defined wifh fhe logarithmic-space 
uniformly generated family of quanfum circuifs, by suifably replacing polynomial-lime compulable functions by 
logarilhmic-space compulable funclions in fhe stalemenls of Theorems [U and |2] and Corollary [3l and by changing 
fhe definilions of polynomially and logarilhmically bounded funclions by using logarilhmic-space deterministic 
Turing machines and logarilhmic-space compulable functions. In Ibis case, Ihe completeness resulls of Theorem 
hold even under logarilhmic-space many-one reduction. 

One Ihing lo be mentioned is, however, lhal some complexify classes defined in Subsection 13.41 such as 
QiogP(c, s), Q[i]P(c, s), BQiQgP, and BQj^jP, may depend on Ihe definition of Ihe uniformity of quanlum cir- 
cuils. Indeed, wifh fhe “hard-coded” definition of polynomial-lime uniformity, Ihe class P is Irivially conlained 
in each class defined in Subsection 13.41 whereas if becomes unclear whelher P is included in fhe bounded-error 
classes such as BQj^gP and BQj^jP when Ihe classes are defined wilh Ihe slandard “non-hard-coded” definition 
of polynomial-lime uniformily and wilh logarilhmic-space uniformity. There even exisls an oracle relative lo 
which P is nol included in BQqjP, when BQjj^jP is defined wilh logarilhmic-space uniformity llShe061IShe09l . In 
Ihis regard, when using Ihe slandard “non-hard-coded” definition of polynomial-lime uniformity, polynomial-lime 
many-one reduction may be loo powerful for Ihe models discussed in lhal Ihe reduction ilself already has compula- 
lional power enough lo solve any problem in P, while il is unclear whelher Ihe models for which Ihe completeness 
resulls are discussed have such compulalional power, allhough Theorem 0 ilself is malhemalically valid even in 
Ibis case. 

In shorl, which uniformity of eilher “hard-coded” polynomial-lime, or slandard “non-hard-coded” polynomial- 
lime, or logarilhmic-space is used lo define Ihese complexity classes does not affecl Ihe properties proved in Ihe 
presenl paper, bul may affecl how large Ihese complexity classes Ihemselves are. 

4 Building Blocks 

Firsl, some nolalions are summarized lhal are used Ihroughoul Ibis section. 

Consider any quanlum circuil Q acting over w qubils. For any positive integer A: < m, lei PacdQ, k) denote 
Ihe acceplance probability of Ihe A:-clean-qubil compulation induced by Q. More precisely, for any positive inte¬ 
ger k < w, lei pld^^ t)e Ihe m-qubil initial slate defined by 

( T\ ^(w-k) 

y 

and lei Ilacc be Ihe projection operator defined by 

nacc = |0)(0|®/®("'-l). 

Now, for any positive integer k < w, Ihe acceplance probability Pacc(Q) k) is defined by 

Pa.cc{Q,k) = trllaccQpInif^Q^- 


4.1 Simulating fc-Clean-Qubit Computation by One-Clean-Qubit Computation 

This subsection presenls a procedure, called Ihe One-Clean-Qubit Simulation Procedure, lhal conslrucls 
anolher quanlum circuil R from Q such lhal Ihe A-clean-qubil compulation induced by Q can be simulated by Ihe 
one-clean-qubil compulation induced by R. More formally, given a description of a quanlum circuil Q (which also 
specifies Ihe number w of qubils Q acls over) and an integer k lhal specifies Ihe number of clean qubils used in 
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One-Clean-Qubit Simulation Procedure 


1. Prepare a single-qubit register 0 a /c-qubit register Q, and a{w — fc)-qubit register R, where the qubit in 0 is 
supposed to be initially in state |0), while all the qubits in Q and R are supposed to be initially in the totally 
mixed state 7/2. 

Apply the /c-controlled-77 transformation A^{H) to (0, Q) using the qubit in 0 as the target, and then apply 
the NOT transformation X to each of the qubits in Q (applying these transformations has essentially the 
same effect as conditionally applying 77 to 0 if all the qubits in Q are in state |0)). 

2. Apply Q to (Q, R). 

3. Apply the phase-flip (i.e., multiply the phase by —1) if the content of (0, ) is 11, where denotes the 

single-qubit register consisting of the first qubit of Q. 

4. Apply Qt to (Q, R). 

5. Apply X to each of the qubits in Q, and then apply A^{H) to (0, Q) using the qubit in 0 as the target 
(applying these transformations has essentially the same effect as conditionally applying 77 to 0 if all the 
qubits in Q are in state |0)). Measure the qubit in 0 in the computational basis. Accept if this results in |0), 
and reject otherwise. 


Figure 4: The One-Clean-Qubit Simulation Procedure induced by a quantum circuit Q with the specifi¬ 
cation of the number k of clean qubits used in the computation of Q to be simulated. 


the original computation to be simulated, the One-Clean-Qubit Simulation Procedure corresponds to a 
quantum circuit R such that the original A:-clean-qubit computation induced by Q is simulated by the one-clean- 
qubit computation induced by R. 

The circuit R acts over {w + 1) qubits, which are divided into three quantum registers: a single-qubit register 0, 
a A:-qubit register Q, and a{w — 7)-qubit register R. It is supposed that the qubit in 0 is initially in state |0), and all 
the qubits in Q and R are initially in the totally mixed state 7/2. Let denote the single-qubit register consisting 
of the first qubit of Q, which corresponds to the output qubit of Q. First, the circuit R applies the Hadamard 
transformation 77 to 0 if all the qubits in Q are in state |0). Next, R applies Q to (Q, R), where the qubits in Q 
correspond to the k clean qubits of the original 7-clean-qubit computation induced by Q. Now R flips the phase 
if and only if (0,Q(^)) contains 11 (i.e., R applies the controlled-Z transformation A{Z) to (0,Q(^))). R then 
applies to (Q, R), and further applies 77 to 0 if all the qubits in Q are in state |0). Finally, the qubit in 0 is 
measured in the computational basis, and R outputs the measurement result. 

For the actual construction, the first conditional application of 77 to 0 when all the qubits in Q are in state |0) is 
essentially realized by first applying the 7-controlled-77 transformation A^(77) to (0, Q) using the qubit in 0 as the 
target, and then applying the NOT transformation X to each of the qubits in Q. Similarly, the second conditional 
application of 77 to 0 is essentially realized by first applying X to each of the qubits in Q, and then applying 
A^(77) to (0, Q) using the qubit in 0 as the target. Figures |4] and [5] summarize the construction of R. 

Proposition 17. For any quantum circuit Q and any positive integer k, let R be the quantum circuit correspond¬ 
ing to the One-Clean-Qubit Simulation Procedure induced by Q and 7. For the acceptance probabil¬ 
ity Pa.cciQ,k) of the k-clean-qubit computation induced by Q and the acceptance probability Pe,cc{R-, 1) of the 
one-clean-qubit computation induced by R, it holds that 

l-2-^{l-{p^,,{Q,k)f) <Pacc(i?,l) < l-2-^(l-pacc(Q,A:)). 
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clean qubit 



Figure 5: The quantum circuit for the One-Clean-Qubit Simulation Procedure induced by a quantum 
circuit Q with a specification of the quantum register Q, where the qubit in 0 is supposed to be the only clean 
qubit at the beginning of the computation, and is also the output qubit of the computation. The combinations of 
the controlled-Hadamard transformation and the NOT transformations X essentially correspond to applying the 
Hadamard transformation if all the qubits in Q are in state 10). 


Proof. To analyze the acceptance probability Pacc{R, 1) of the one-clean-qubit computation induced by R, suppose 
that the content of Q is initially q in and the content of R is initially r in 

First consider the case where q is not all-zero. In this case, nothing is applied to the qubit in 0 in Step 1, 
and thus the phase-flip is never performed in Step 3. Therefore, the application of in Step 4 cancels out the 
application of Q in Step 2, and thus the content of Q remains q in Step 5, which is not all-zero. Hence, nothing is 
applied to the qubit in 0 in Step 5, either, and R outputs 0 and accepts. 

Now consider the case where q is all-zero. In this case, by letting Ifir) = |0)®^ (8* |r) the state in (0, Q, R) just 
after Step 2 is given by 

±{\0) + \l))^{Q\fr)). 

For each j in {0,1}, let Hj be the projection operator acting over w qubits defined by Hj = \j){j\ 0 
(notice that Hq is nothing but Hacc)- Then the state in (0, Q, R) just after Step 4 is given by 

^|o) ® \fr) + ^|1) ® [Q^(no - iii)Q\A)] 

= ^|0) ® \fr) + ^|1) ® [QU2no - 

= ^(|0) - |1)) ® \fr) + V2 |1) ® {Q^UoQlA})- 

Further define the projection operators Aq and Ai acting over w qubits by Aq = (|0)(0|)®^ (8>and 
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Ai = I®"' — Aq. Then the state in (0, Q, R) just after Step 5 is given by 

|1) ® Hr) + (|0) - |1)) ® (AoQ^UoQHr)) + V2 |1) ® (AiQtnoQIV^,)). 

Hence, the probability that R outputs 0 is given by || AoQ^HoQIV'r) ||^- 

It follows that the overall acceptance probability Pacc(f?) 1) of the one-clean-qubit computation induced by R 
is given by 

Pacc(i?,l) = (l-2-'=) + 2-".2-(—^ \\AoQ^UoQHr)f- (1) 

^^■£(w-k) 

First notice that Aq = (|0)(0|)®^ (g) = Z]r-es(“-fc) IV’r)(V’r-|, and thus, it holds that 

||Aog%Q|V^.)|| > |||V^r)(V’r-|Q^nog|V'.)|| = \Hr\Q^UoQHr)\ = IlnoQiV'.jf, 

which is exactly the acceptance probability of the circuit Q when the input state to it was Hr) = |0)'^^ (g) |r). As 
the acceptance probability Pacc{Q, k) of the /c-clean-qubit computation induced by Q is nothing but the expected 
value of ||noQ|V’r)||^ over r in it follows that 

2-(«.-fc) ^ \\AoQ^UoQHr)f 

r^T,{'<r-k) 

^L-(^-k) ^ ||AoQ%Q|V>r-)||) >( 2 -^^-^^ Y. ||noQ|^r)||'] =(Pacc(Q,A:))2. 

^ re'L('^-k) ^ 

Combined with the equation ([Til, this implies that 

Pacc(i?, 1) > (1 - 2-"=) + 2-Hp.cc{Q, k))^ = (pacc(Q, k))^) , 

which provides the first inequality. 

Now notice that || AcQ^HoQI'^r)!!^ is at most ||no(3|V’r)||^> which is again exactly the acceptance probability 
of the circuit Q when the input state to it was Hr) = |0)®*^ (g) |r). Again using the fact that Pa,cc{Q, k) is nothing 
but the expected value of ||no(5|V’?’)||^ over r in it holds that 

2 -(^_fc) ^ ||Aog%Q|V^.)f <2-(“'-'=) ^ \\UoQHr)\f =Ps.c{Q,k). 

r^Sin-k) reE(^~k) 

Combined with the equation ([T]), this implies that 

Pacc(R, !)<(!- 2-^) + 2-^Pacc(Q, k) = l- 2-^1 - Pacc(Q, k)), 

and the second inequality follows. □ 

4.2 Amplifying Randomness of One-Clean-Qubit Computation 

This subsection presents a procedure, called the RANDOMNESS Amplification Procedure, that constructs 
another quantum circuit R^^^ from Q when a positive integer N is specified. The circuit R^^^ is designed so that 
the sequence l)}jveN acceptance probability of the one-clean-qubit computation induced by 

R(^^ converges linearly to 1/2 with a rate related to the acceptance probability Pacc(Q) 1) of the one-clean-qubit 
computation induced by <3, if 0 < Pacc(Q) 1) < 1- 

For each N in N, the circuit R^^^ acts over Nw + 1 qubits, which are divided into 2N + 1 quantum registers: 
a single-qubit register 0, and a (m — l)-qubit register Rj and a single-qubit register Xj for each j in {1,..., A}. 
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Randomness Amplification Procedure 


1. Prepare a single-qubit register 0, where the qubit in 0 is supposed to be initially in state |0). Prepare a 
{w — 1)-qubit register Rj and a single-qubit register Xj, for each j in {1,..., where all the qubits in Rj 
and Xj are supposed to be initially in the totally mixed state 1/2. 

2. For y = 1 to N, perform the following: 

2.1. Apply Q to (0, Rj). 

2.2. Apply the CNOT transformation to (0, Xj) with the qubit in 0 being the control. 

3. Measure the qubit in 0 in the computational basis. Accept if this results in |0), and reject otherwise. 


Figure 6: The RANDOMNESS Amplification Procedure. 

It is supposed that the qubit in 0 is initially in state |0), and all the qubits in Rj and Xj are initially in the totally 
mixed state 1/2, for all y in {1,... , A^}. For y = 1 to N, the circuit repeats the process of first applying 
Q to (0, Rj) and then applying the CNOT transformation to (0,Xj) using the qubit in 0 as the control, where 
each application of the CNOT transformation essentially has the same effect as measuring the qubit in 0 in the 
computational basis every time after Q is applied. Finally, the qubit in 0 is measured in the computational basis, 
and outputs the measurement result. Figures [6]and|7]summarize the construction of R^^\ (Strictly speaking, 
when y = A^ at Step 2 of Figure^ it is redundant to apply CNOT at Step 2.2, as the qubit in 0 is anyway measured 
at Step 3 in the computational basis. This point is reflected in Figure |7]) 

Proposition 18. For any quantum circuit Q and any positive integer N, let R^^'i be the quantum circuit corre¬ 
sponding to the Randomness Amplification Procedure induced by Q and N. For the acceptance proba¬ 
bility PacciQi 1) of the one-clean-qubit computation induced by Q and the acceptance probability Pacc{R^^\ l) of 
the one-clean-qubit computation induced by R^^\ it holds that 

p^,,{R^^\l) = i + i(2pacc(Q, 1) - 1)"^. 


Proof For each y in {0,1}, let Ilj be the projection operator acting over w qubits defined by 
Ilj = |y)(yj ® jg(. p. quantum state of w qubits defined by pj = \ j){j\ <8) Note 

that Flo = Tlacc and po = and thus, the acceptance probability Pa,cc{Q, 1) of the original one-clean-qubit 

computation induced by Q is given by 

PucciQ, 1) = trUoQpoQ^. 

By noticing that IIi = I®"' - Flo and 

trUoQpoQ^ + tTUoQpiQ^ = tr = ^^trllo = 1, 

it also holds that 

trlliQpiQt = 1 - trlloQpiQ^ = 1 - (l - trlloQpoQ^) = trUoQpoQ^ =Pacc{Q, !)• 

This implies that, after each repetition of Step 2 of the RANDOMNESS Amplification Procedure, the content 
of 0 remains unchanged with probability p^cciQ, !)> and is flipped with probability 1 — PacciQ, 1) (by viewing 
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Figure 7: The quantum circuit for the RANDOMNESS Amplification Procedure induced by a quantum 
circuit Q, where the qubit in 0 is supposed to be the only clean qubit at the beginning of the computation, and is 
also the output qubit of the computation. 


that 0 is “measured” in the computational basis as a result of the application of CNOT at Step 2.2). The content 
of 0 is 0 when entering Step 3 if and only if the content of 0 is flipped even number of times during Step 2. By 
Lemma [TOl this happens with probability exactly ^ + ^(2pacc(Q, 1) — 1)^, which gives the acceptance probabil¬ 
ity Pace , l) of the one-clean-qubit computation induced by □ 
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4.3 Checking Stability of One-Clean-Qubit Computation 

This subsection considers two procedures of checking whether a given one-clean-qubit computation has sufficiently 
high acceptance probability or not. For a positive integer N specified, fhe firsf procedure, called fhe One-Clean- 
Qubit Stability Checking Procedure, constructs another quantum circuit from Q so that the accep¬ 
tance probability Pacc{Ri^\ l) of the one-clean-qubit computation induced by r[^'^ is polynomially small with 
respect to N when the acceptance probability was sufficiently close to 1/2 in the original one-clean-qubit computa¬ 
tion induced by Q. The second procedure, called the Two-Clean-Qubit Stability Checking Procedure, 
is a modification of the first procedure so that, using two clean qubits, the acceptance probability pacc(^ 2 ^\ 2 ) 
of the two-clean-qubit computation induced by the circuit R^^'^ constructed from Q now becomes exponentially 
small with respect to N when the acceptance probability was sufficiently close to 1/2 in the original one-clean-qubit 
computation induced by Q. 

4.3.1 Stability Checking Using One Clean Qubit 

First consider a slightly simplified version of fhe One-Clean-Qubit Stability Checking Procedure de¬ 
scribed below. 

For each in N satisfying N > 2, fhe procedure fo be consfrucfed prepares 2N + 1 quanfum regisfers: a 
single-qubif regisfer Q, and a (m — l)-qubif regisfer Rj for each j in {1,..., 2iV}. If is supposed fhaf all fhe qubifs 
in Q and Rj are initially in fhe fofally mixed sfafe 7/2, for all j in {1,..., 2A^}. The procedure also prepares an 
infeger variable C fhaf serves as a counfer, buf fhe value of C is nof necessarily initialized fo 0 af fhe beginning 
of fhe compufafion, and fhe inifial value r of (7 is chosen from fhe sef {0,... , — 1} uniformly af random. For 

y = 1 fo 2N, fhe procedure repeafs fhe process of firsf applying Q fo (Q, Rj) and fhen measuring fhe qubif in Q in 
fhe compufafional basis. Everyfime fhis measuremenf resulfs in |1), fhe value of fhe counfer C is increased by one. 
Affer fhis repetition of 2N times, fhe procedure checks fhe value of C, and rejecfs if if is befween N and 2N — 1. 

For fhe acfual consfrucfion of fhe above simplified procedure, fhe positive infeger N is chosen fo be a power 
of fwo so fhaf fhe mosf significanf bif is 0 for any infeger in fhe inferval [0, A^ — 1] and is 1 for any infeger in fhe 
inferval [A^, 2N — 1], when expressed as a binary siring using log A^ -|- 1 bifs. The acfual procedure also infroduces 
a (log A^ -|- 1)-qubif regisfer C and a single-qubif regisfer Xj for each j in {1,..., 2A^}. Only fhe firsf qubif of 
C is supposed fo be inifially in sfafe |0), and all fhe ofher qubifs used in fhe acfual procedure are supposed fo be 
initially in fhe fofally mixed sfafe 1/2. The confenf of C serves as a counfer C of fhe simplified procedure. The 
condifion fhaf only fhe firsf qubif in C is inifially in sfafe 10) and all fhe ofher qubifs in C are inifially in fhe fofally 
mixed sfafe 1/2 exacfly corresponds fo fhe process in fhe simplified procedure of randomly picking fhe inifial 
value of fhe counfer C from fhe sef {0,... , A^ — 1}. The condifional incremenf of fhe counfer value is realized 
by fhe confrolled-f7+i(Z2Ar) fransformafion A(C/+i(Z 2 Ar)). The final decision of accepfance and rejection of fhe 
simplified procedure can be done by measuring fhe firsf qubif of C in fhe compufafional basis. Each regisfer Xj 
is used fo simulafe fhe measuremenf af each repetition round j. More precisely, for each repefifion round j, fhe 
measuremenf of fhe qubif in Q in fhe compufafional basis is replaced by fhe application of a CNOT fransformafion 
fo (Q, Xj). Eiguresd] and |9] summarize fhe acfual consfrucfion of fhe One-Clean-Qubit Stability Checking 
Procedure. 

Eirsf analyze fhe lower bound of fhe accepfance probabilify of fhe one-clean-qubif compufafion induced by fhe 
quanfum circuif resulting from fhe One-Clean-Qubit Stability Checking Procedure. 

Proposition 19. For any quantum circuit Q and any positive integer N that is a power of two, let R^'^ be the quan¬ 
tum circuit corresponding to the One-Clean-Qubit STABILITY CHECKING PROCEDURE induced by Q and N. 
For the acceptance probability PacciQ^ 1) of the one-clean-qubit computation induced by Q and the acceptance 
probability Pacc{Ri^\ l) of the one-clean-qubit computation induced by R^^\ it holds that 

Pacc(-Rl^^l) ^ (Pacc(Q, 1))^^“^- 
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One-Clean-Qubit Stability Checking Procedure 

1. Given a positive integer N that is a power of two, let I = log + 1. Prepare an Z-qubit register C, a single¬ 
qubit register Q, and each {w — l)-qubit register Rj for each j in {1,, 2A^}. For each j in {1,... , 1}, let 

denote the single-qubit register corresponding to the jth qubit of C. The qubit in is supposed to be 
initially in state |0), while the qubit in for each j in {2,..., 1}, the qubit in Q, and all the qubits in Rj/ 
for each j' in {1,..., 2A^} are supposed to be initially in the totally mixed state 1/2. 

Prepare a single-qubit register Xj for each j in {1,..., 2A^}, where the qubit in each Xj is supposed to be 
initially in the totally mixed state 1/2. 

2. For y = 1 to 2N, perform the following: 

2.1. Apply Q to (Q, Rj). 

2.2. Apply the CNOT transformation to (Q, Xj) with the qubit in Q being the control. Apply the controlled- 
C/_i_i(Z 27 v) transformation A(C/_|_i(Z 2 Ar)) to (Q, C) with the qubit in Q being the control. 

3. Measure the qubit in in the computational basis. Accept if this results in |0), and reject otherwise. 


Figure 8: The One-Clean-Qubit Stability Checking Procedure. 


Proof. As in Subsection 14.21 for each j in {0,1}, let Ilj be the projection operator acting over w qubits defined 
by Ilj = \j){j\ ® and let pj be the quantum state of w qubits defined by pj = |j)(jj <8i The 

acceptance probability Pa,cc{Q, 1) of the original one-clean-qubit computation induced by Q is given by 

PucciQ, 1) = tr^oQpoQ^ 

and it holds that 

tlcYliQpiQ^ — Pajcc{,Qi !)• 

This implies that, for each repetition round during Step 2, the counter value stored in C is increased by one with 
probability 1 — Pa,cc{Q, 1) if the content of Q was 0 when entering Step 2.1, while it is increased by one with 
probability Pa,cc{Q: 1) if the content of Q was 1 when entering Step 2.1. 

Notice that, at the yth repetition round in Step 2 for j > 2, the content of Q is 1 when entering Step 2.1 if 
and only if the previous repetition round has increased the counter value in C. Hence, taking into account that the 
content of Q is initially 0 or 1 with equal probability, after all the repetition rounds of Step 2, the counter value in 
C is never increased with probability 

^(Pacc(Q, + ^(1 - Pacc(Q, 1)) (Pacc(Q, = ^(Pacc(Q, 

while it is increased by 2N with probability 

^(1 -Pacc(Q,l)) (pacc(Q,l))^^"^ + ^(pacc(Q, 1))^^ = ^(pacc(Q, 1))^^"^ 

The content of C stationarily remains its initial value r < — 1 for the first case, and it comes back to the initial 

value r < A^ — 1 for the second case. As Step 3 results in acceptance at least in these two cases, it follows that 

Pacc(M^\ l) > ^(Pacc(Q, 1))^^"^ + ^(Pacc(Q, 1))^^"^ = (Pacc(Q, 

as claimed. □ 
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Figure 9: The quantum circuit for the One-Clean-Qubit Stability Checking Procedure induced by a 
quantum circuit Q and a positive integer N, where U+i is a shorthand of the increment transformation U+i{Z 2 n)- 
The qubit in C^^^ is supposed to be the only clean qubit at the beginning of the computation, and is also the output 
qubit of the computation. 
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Proposition [T9]in particular implies that, if the original acceptance probability Pacc(Q) 1) is one, the acceptance 
probability Pacc{Ri^\ l) is also one for the circuit corresponding to the One-Clean-Qubit Stability 
Checking Procedure induced by Q and N. 

The next proposition provides an upper bound of the acceptance probability of the one-clean-qubit computation 
induced by the quantum circuit resulting from the One-Clean-Qubit Stability Checking Procedure, 
assuming that the original acceptance probability PacciQ, 1) is close to 1/2. 

Proposition 20. For any quantum circuit Q and any positive integer N that is a power of two and at least 2® = 64, 
let r[^'^ be the quantum circuit corresponding to the One-Clean-Qubit Stability Checking Procedure 
induced by Q and N. If the acceptance probability Pacc(Q) 1) of the one-clean-qubit computation induced by Q 
satisfies that ^ — e < PacdQ, 1) < ^ + s for some e in [O, |] such that3N~s -|- 4e < 1, it holds for the acceptance 
probability Pacc{Ri^\ l) of the one-clean-qubit computation induced by r[^'^ that 

p^c{Rf\l) <3N-^+4s. 


Proof As before, for each j in {0,1}, let Ilj be the projection operator acting over w qubits defined by 
rtj = \j){j\ <8) and let pj be the quantum state of w qubits defined by pj = |j)(jj (8) The 

acceptance probability Pa,cc{Q, 1) of the original one-clean-qubit computation induced by Q is given by 

PucciQ, 1) = tr^oQ/9oQ^ 


and it holds that 

tvYliQpiQ^ — PaicdQi !)• 

Notice that, for each repetition round during Step 2, the counter value in C is increased by one with probability at 
least min{pacc(Q, 1), 1 - Pacc(Q, 1)} > ^ - e and at most max{pacc(Q, 1), 1 - PacdQ, 1)} < | + e regardless 
of the content of Q being 0 or 1 when entering Step 2.1. Hence, from the Hoeffding bound (Lemma |9l), the 
probability that the total increment of the counter value in C is at most (1 — 2e — 26)N after all the 2N repetition 
rounds of Step 2 is less than for any 6 in [O, ^ — e] ■ Similarly, the probability that the total increment of 

the counter value in C is at least (1 -|- 2e -I- 26)N after all the 2N repetition rounds of Step 2 is less than 
also, for any 6 in [O, ^ — e]. It follows that, when 5 is in i “ ^ + 3^] the initial counter value r satisfies 

that 

2(e + <5)A^ - 1 < r < - 2(e + 5)N, 

after all the 2N repetition rounds of Step 2, the probability that the counter value in C is in the interval [N, 2N — 1] 
is more than 1 — > 1 — and thus, the acceptance probability at Step 3 is less than 


Af c 


O n 


By taking 4 to be 3 (which is in 3 ~ e + 3^] as e is at most 1/8 and N is at least 2® = 64), and using 

the fact that x < 2^~^ holds for any x > 2, it follows that 


Pacc(4^\ 1) < 2N-1 + 2-^^+^ +4e <3N-l+4e, 


which completes the proof. 


□ 
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Two-Clean-Qubit Stability Checking Procedure 

1. Given a positive integer N that is a power of two, let I = log + 3. Prepare an Z-qubit register C, a single¬ 
qubit register Q, and each {w — l)-qubit register Rj for each j in {1,, 8A^}. For each j in {1,... , 1}, let 

denote the single-qubit quantum register corresponding to the jth qubit of C. The qubits in and 
are supposed to be initially in state |0), while the qubit in for each j in {3,..., 1}, the qubit in Q, and all 
the qubits in Rj/ for each j' in {1,..., 8A^} are supposed to be initially in the totally mixed state 1/2. 
Prepare a single-qubit register Xj for each j in {1,..., 8A^}, where the qubit in each Xj is supposed to be 
initially in the totally mixed state 1/2. 

2. For y = 1 to N, apply the unitary transformation U+i{’Ijsn) to C. 

3. For y = 1 to 8N, perform the following: 

3.1. Apply Q to (Q, Ry). 

3.2. Apply the CNOT transformation to (Q, Xj) with the qubit in Q being the control. Apply the controlled- 
C/_i_i(Z 87 v) transformation A(C/+i(Z 87 v)) to (Q, C) with the qubit in Q being the control. 

4. Measure the qubit in in the computational basis. Accept if this results in |0), and reject otherwise. 


Figure 10: The Two-Clean-Qubit Stability Checking Procedure. 

4.3.2 Stability Checking Using Two Clean Qubits 

As can be seen in the proof of Proposition l20l one drawback of the One-Clean-Qubit Stability Checking 
Procedure when analyzing the upper bound of its acceptance probability is that there are some “bad” initial 
counter values with which the procedure is forced to accept with unallowably high probability even if the acceptance 
probability Pacc{Q, 1) of the underlying circuit Q was sufficiently close to 1/2. This is the essential reason why 
only a polynomially small upper bound can be proved on the acceptance probability of the One-Clean-Qubit 
Stability Checking Procedure in Proposition l20l As there are only polynomially many number of possible 
initial counter values, even just one “bad” initial value is unacceptable to go beyond polynomially small upper 
bounds. The authors do not know how to get rid of this barrier with the use of just one clean qubit. 

In contrast, if two clean qubits are available, one can easily modify the procedure so that it has no “bad” 
initial counter value. The idea is very simple. This time, the register C uses log -|- 3 qubits so that the counter 
takes values in Z8Ar = {0,... , 8N — 1}. The first two qubits of C are supposed to be initially in state |0), which 
implies that the initial counter value is picked uniformly from the set {0,... , 2N — 1}. The point is that one 
can increase the counter value by N before the repetition starts so that the actual initial value of the counter 
is in the set {A^, ..., 3N — 1}. If the acceptance probability Pa^dQ, 1) of the underlying circuit Q was in the 
interval ^ + e] for some sufficiently small e, the expected value of the counter after the repetition must be 

in the interval [(5 — 8e)N, (7 -f 8£)N — 1]. Hence, if e is in [O, , for instance, the probability is exponentially 

small for the event that the final counter value is not in the interval [4A^, 8N — 1]. This leads to the Two-Clean- 
Qubit Stability Checking Procedure, whose construction is summarized in Figure [TOl 

First, the following lower bound holds for the acceptance probability of the two-clean-qubit computation in¬ 
duced by the quantum circuit resulting from the Two-Clean-Qubit Stability Checking Procedure. In 
particular, if the original acceptance probability PacdQ, 1) is one, the acceptance probability Pacc(f? 2 ^\ 2) is also 
one for the circuit corresponding to the Two-Clean-Qubit Stability Checking Procedure induced 
by Q and N. 
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Proposition 21. For any quantum circuit Q and any positive integer N that is a power of two, let be the quan¬ 
tum circuit corresponding to the Two-Clean-Qubit STABILITY Checking Procedure induced by Q and N. 
For the acceptance probability Pacc(Q) 1) of the one-clean-qubit computation induced by Q and the acceptance 
probability Pacc{R^\ 2) of the two-clean-qubit computation induced by R^\ it holds that 

Pacc{R 2 ^\^) > (Pacc(Q, 1))®^"^- 


The proof of Proposition!^ is essentially the same as that of Proposition [T9l and is omitted. 

The next proposition provides an upper bound of the acceptance probability of the two-clean-qubit computation 
induced by the quantum circuit resulting from the Two-Clean-Qubit Stability Checking Procedure, 
assuming that the original acceptance probability Pacc(Q) 1) is close to 1/2. 

Proposition 22. For any quantum circuit Q and any positive integer N that is a power of two and at least 2“^ = 16, 
let r!^'^ be the quantum circuit corresponding to the TwO-CleaN-Qubit STABILITY CHECKING PROCEDURE 
induced by Q and N. If the acceptance probability PacciQ, 1) of the one-clean-qubit computation induced by Q sat¬ 
isfies that ^ — e < Pa.cc{Q, 1) < ^ + e/or some e in [O, ^], it holds for the acceptance probability pacc(^ 2 ^\ 2) 
of the two-clean-qubit computation induced by R^'^ that 

Pacc(i?f\ 2 ) < 2 -^+/ 


Proof The proof is similar to that of Proposition l20l As before, for each j in {0,1}, let Tlj be the projection 
operator acting over w qubits defined by Yij = \ j) {j\ (g) and let pj be the quantum state of w qubits defined 

by Pj = \j){j\ (g) The acceptance probability Pax^dQ, 1) of the original one-clean-qubit computation 

induced by Q is given by 

PacciQj 1) — tr ) 

and it holds that 

iiliiQpiQ^ = Pacc(Q, !)• 

Notice that, for each repetition round during Step 3, the counter value in C is increased by one with probability at 
least min{pacc(Q, 1), 1 - PacdQ, 1)} > ^ - e and at most max{pacc(Q, 1), 1 - PacdQ, 1)} < 5 + e regardless 
of the content of Q being 0 or 1 when entering Step 3.1. Hence, from the Hoeffding bound (Lemma O, the 
probability that the total increment of the counter value in C is at most (4 — 8e — 86)N after all the 8N repetition 
rounds of Step 3 is less than , for any 5 in [O, ^ — e]. Similarly, the probability that the total increment of 

the counter value in C is at least (4 -I- 8e -|- 86)N after all the 8N repetition rounds of Step 3 is less than 
also, for any 6 in [O, ^ — e]. As the counter value r in C at the beginning of Step 3 satisfies that 

N < r < 3N — 1, 


when 6 is in g “ ^ + 8^]> if holds that, after all the 8N repetition rounds of Step 3, the probability that 

the counter value in C is in the interval [4Af, 8N — 1] is more than 1 — > 1 — , and thus, the 

acceptance probability at Step 4 is less than 


1 - (1 - 2 


By taking 5 to be it follows that 


which completes the proof. 


Pacc{Rf\2) <2 


- — -II 
16^ . 


□ 
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5 Error Reduction for One-Sided-Error Cases 


This section proves the error reduction results in the cases of one-sided bounded error. First, Subsection 15 .1 1 treats 
the cases with perfect completeness, namely. Theorems [U and |2l The cases with perfect soundness (Corollary [3]) 
then easily follow from Theorems [T] and |2j as will be found in Subsection l5.2l 


5.1 Cases with Perfect Completeness 

5.1.1 One-Clean-Qubit Case 

This subsection proves Theorem O stating that any problem computable using logarithmically many clean qubits 
with one-sided bounded error of perfect completeness and soundness bounded away from one by an inverse- 
polynomial is necessarily computable using only one clean qubit with perfect completeness and polynomially 
small soundness error. 


Proof of Theorem^ For any polynomial-time computable function s: [0,1] satisfying 1 — s > | for some 

polynomially bounded function q \ N, let A = (Ayes,^no) be a problem in QiogP(l,s). Then A has 

a polynomial-time uniformly generated family {Qx}xe'E* of quantum circuits such that, for every input x, 
Pa.cc{Qx, ^(|3;|)) = 1 if X is in ^dyes and PacdQx, ^(|3;|)) < s(|x|) if x is in A^^o for some logarithmically bounded 
function k: TLA —)• N. For any polynomially bounded function p: TLA —)• N, the proof constructs a polynomial-time 
uniformly generated family of quantum circuits that puts A in Q[i]P(l, f). 

Fix an input x. 

From the circuit Qx, one first constructs a quantum circuit Rx according to the One-Clean-Qubit Simula¬ 
tion Procedure. By Proposition [TTl it holds that Pajcc{Rx, 1) = 1 if x is in Ayes, and 

1 _ 2-^(i-i) <1) < 1 - 2-"(N)(i - KkD) < 1 - 


if X is in Ano^ where ri: lA —)• N is the polynomially bounded function defined by ri = 2^q. 

From the circuit Rx, one constructs a quantum circuit R'^. according to the RANDOMNESS Amplification 
Procedure with the integer N = ri(|x|) T 2 (|t|) for a polynomially bounded function r 2 - 'lA —N satisfying 
T 2 > P + 1- By Proposition [m it holds that pacc(f?*) 1) = 1 if x is in Ayes, and 


1 . X 1 1 

— < PucciRx^ ^ “ 2 2 


rO X 


ri{\x\)r2i\x\) 


1 _|_ lg-2r2(|3:|) < 1 _|_ 


if X is in Ano- 

Finally, from the circuit R'x, one constructs a quantum circuit R” according to the One-Clean-Qubit Sta¬ 
bility Checking Procedure with N to be the smallest integer that is a power of two and at least (6p(|x|))^ 
(i.e., N = > 2® = 64). By Propositions [T^ and l20l it holds that p^cciR'd 1) = 1 if x is in Ayes, and 


Pacc(-R", 1) < 


1 

2p(|x|) 


_l_ 4. 


1 

p(|x|) 


if X is in Ano, where the last inequality follows from the facts that r 2 > p + 1 and that the inequality 2"^ > n holds. 
The claim follows with the polynomial-time uniformly generated family {R!f)x&T.* of quantum circuits. □ 


5.1.2 Two-Clean-Qubit Case 

This subsection proves Theorem [T] stating that, using two clean qubits rather than one, soundness can be made 
exponentially small. 
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Proof of TheoremU} The proof is essentially the same as that of Theorem|2j except that the circuit i?" is constructed 
according to the Two-Clean-Qubit Stability Checking Procedure rather than the One-Clean-Qubit 
Stability Checking Procedure. 

For any polynomial-time computable function s: [0,1] satisfying 1 — s > | for some polynomially 

bounded function q: —)• N, let A = (Ayes, ^no) be a problem in QiogP(l, s). Then A has a polynomial-time 

uniformly generated family {Qxjxes* of quantum circuits such that, for every input x, PacciQx, ^(|t|)) = 1 if x 
is in Ayes and Pa.cc{Qx,k{\x\)) < s(|x|) if x is in Ano for some logarithmically bounded function k: — )• N. 
For any polynomially bounded function p: Z+ —)■ N, the proof constructs a polynomial-time uniformly generated 
family of quantum circuits that puts A in Q[ 2 ]P( 1 , 2~p). 

Fix an input x. 

From the circuit Qx, one first constructs a quantum circuit according to the One-Clean-Qubit Simula¬ 
tion Procedure. By Proposition [TTl it holds that PucciRx, 1) = 1 if x is in Ayes, and 

1 _ 2-^(N) < 1) < 1 - - s(|x|)) < 1 - 

r{\x\) 

if X is in Ano, where r: Z+ —)• N is the polynomially bounded function defined by r = 2^q. 

From the circuit R^, one constructs a quantum circuit i?(, according to the RANDOMNESS Amplifica¬ 
tion Procedure with the integer N > ar(|x|), where a = |ln2 < 1.04. By Proposition [HI it holds that 
Pa.cc{R'x, 1) = 1 if X is in Ayes, and 


1 , ^ 1 1 2 \ 
2 <Pacc(i?.,l)< 2 + 


a r(|x|) 


1 1 
<2 + 2 ® 


-3 In 2 


1 1 
2 16 


if X is in Ano- 

Finally, from the circuit R'^, one constructs a quantum circuit R” according to the Two-Clean-Qubit Sta¬ 
bility Checking Procedure with N to be the smallest integer that is apower of two and at least 16(p(|x|) -|-1) 
(i.e., N = 2b°s(p(l*l)+i)l+^). If X is in Ayes, Proposition ITU ensures that pacc(-f^x^ 2) = 1- On the other hand, if x 
is in Ano, Proposition l22l ensures that 

Pacc(i?",2)<2-^’(N) 

(notice that the bounds for the probability pacc(^x) 1) above are such that Proposition l22l can be used to show the 
upper bound for the probability pacc(^x) 2))- 

The claim follows with the polynomial-time uniformly generated family of quantum circuits. □ 

5.2 Cases with Perfect Soundness 

Now Corollary [3] is immediate from Theorems [Hand [2] by considering complement problems. 

Proof of Corollary]^ For any polynomial-time computable function c; Z+ —)• [0,1] satisfying c > f- for some 
polynomially bounded function q: TLA N, let A = (Ayes, Ano) be a problem in QiogP(c, 0). Consider the 
complement problem A = (Ano, Ayes) of A. As A is in QiogP(l,(5) for the constant <5 = 1 — c satisfying 
1 — (5 = c > |, it follows from Theorems [T] and [2] that A is in Q[2]P(1, 2“^) and also in QjpP(l, |), for any poly¬ 
nomially bounded function p: Z+ —)■ N. This implies that A is in Q[2]P(1 — 2~^, 0) and also in Qji]P(l — O), 

for any polynomially bounded function p : Z+ N, as desired. □ 
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6 Error Reduction for Two-Sided-Error Cases 


This section considers the cases with two-sided bounded error. First, Subsection 16. 1 I provides one more technical 
tool, called the OR-Type Repetition Procedure, which aims to simulate the standard error-reduction method 
of a repetition with an OR-type decision. By combining this procedure with the three procedures presented in 
SectionlH Theorems |4] and [5] are proved in Subsection 16.21 

6.1 OR-Type Repetition Procedure 

Again consider any quantum circuit Q acting over w qubits, which is supposed to be applied to the fc-clean-qubit 
initial state = (|0)(0|)'^*^ (8* This section presents a procedure, called the OR-Type Repeti¬ 

tion Procedure, that constructs another quantum circuit from Q when a positive integer N is specified. 
Intuitively, the circuit is designed so that, when some additional clean qubits are available other than the 
k clean qubits that are used for the fe-clean-qubit computation induced by Q, one can perform N attempts of the k- 
clean-qubit computation induced by Q and accept if and only if at least one of these attempts results in acceptance 
in the original computation of Q. 

If Nk clean qubits were allowed to use (in addition to one clean qubit as the output qubit), the above procedure 
is easily constructed by preparing N copies of the initial state applying Q to each of these N initial states, 

and accepting if and only if at least one of the N attempts results in acceptance. The only problem in this construc¬ 
tion is that the resulting computation requires unallowably many clean qubits in its initial state, as N is desirably 
much larger than k for the purpose of error reduction of the /c-clean-qubit computations. 

To realize this OR-type repetition idea, consider the procedure described in Figure [TT] Instead of preparing 
N copies of the initial state, now the k qubits in the register Q that were initially clean are reused for each attempt. 
In order to reuse the k clean qubits, the procedure tries to initialize these qubits by applying the inverse of Q in 
Step 2.3 (and measuring each qubit in Q in the computational basis in Step 2.4). Of course, the initialization may 
not necessarily succeed every time, and the failure of the initialization is also counted as an “acceptance” in that 
attempt. The underlying idea of this procedure is essentially the same as that in the method in Ref. BWatOll used for 
reducing the computation error of one-sided bounded error logarithmic-space quantum Turing machines, where the 
phase-shift transformation is applied when the initialization attempt fails, instead of counting it as an “acceptance”. 
It is unclear whether the phase-shift trick in Ref. HWatOllI works in the present case, as one cannot perfectly judge 
in the present case whether the initialization succeeds or not, due to the existence of polynomially many qubits that 
are initially in the totally mixed states. 

The actual construction of the above simplified procedure uses a single-qubit register 0 and an f-qubit register C 
for I = [logiV] -|- 1, and further introduces a single-qubit quantum register Xj and a A:-qubit register Yj for each 
j in {1,..., A^}. The qubits in 0, C, and Q are supposed to be initially in state |0), and all the other qubits 
used in the actual procedure are supposed to be initially in the totally mixed state 1/2. The qubit in 0 serves 
as the output qubit, and the content of C serves as a counter C of the simplified procedure. Each condifional 
incremenf of fhe counfer value is realized eifher by using fhe conlrolled-f7+i(Z2!) fransformafion A(f7+i(Z2!)) 
or by combining fhe U+i{'Z 2 i) fransformafion and fhe A:-confrolled-f7+i(Z2i) fransformafion A^(C/+i(Z 2 i))- The 
decision of accepfance and rejection af Sfep 3 of fhe simplified procedure can be simulafed by combining NOT 
fransformafions and a generalized Toffoli fransformafion. Each regisfer Xj is used fo simulafe fhe measuremenf af 
Sfep 2.2 while each regisfer Yj is used fo simulafe fhe measuremenf af Sfep 2.4, for each repetition round j of Sfep 2. 
More precisely, for each repetition round j of Step 2, fhe measuremenf of fhe qubif in in fhe compufafional 
basis af Sfep 2.2 is replaced by fhe application of a CNOT fransformafion fo Xj), while fhe measuremenfs 
of fhe qubifs in Q in fhe compufafional basis af Sfep 2.4 is replaced by fhe applications of CNOT fransformafions 
fo (Q, Yj). Eigure [12] summarizes fhe acfual consfrucfion of fhe OR-Type Repetition Procedure. 
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OR-Type Repetition Procedure — Simplified Description 

1. Prepare a /c-qubit register Q where all the qubits in Q are supposed to be initially in state |0). For each j in 

{1,..., k}, let denote the single-qubit quantum register corresponding to the jth qubit of Q. Prepare 
{w — A:)-qubit registers Rj, for j in ,N}, where all the qubits in Rj are supposed to be initially in the 

totally mixed state 1/2. Initialize a counter C to 0. 

2. For y = 1 to N, perform the following: 

2.1. Apply Q to (Q, Rj). 

2.2. Measure the qubit in in the computational basis. If this results in |0), increase the counter C by 
one. 

2.3. Apply to (Q, Rj). 

2.4. Measure each qubit in Q in the computational basis. If any of these results in 11), increase the counter C 
by one. 

3. Reject if C = 0, and accept otherwise. 


Figure 11: The OR-Type Repetition Procedure (a simplified description). 

Proposition 23. For any quantum circuit Q and any positive integer N, let be the quantum circuit correspond¬ 
ing to the OR-Type Repetition Procedure induced by Q and N. For the acceptance probability Pacc(Q) k) 
of the k-clean-qubit computation induced by Q and the acceptance probability pacc , k -|- [log N~\ 2) of the 

{k -|- [log N~\ -|- 2)-clean-qubit computation induced by R^^\ it holds that 

1 - (1 - Pacc(Q, k)f < p^cc{R^^\k + [log AT] + 2) < 1 - (1 - Pacc(Q, k)f^. 


Proof For ease of explanations, the proof analyzes the simplified version of fhe OR-Type Repetition Pro¬ 
cedure in Figure [TT] which is sufficienf for fhe claim, as fhe acfual consfrucfion of fhe OR-Type Repetition 
Procedure exacfly simulates fhe simplified version. 

Firsf notice fhaf fhe counfer value remains zero when entering Sfep 3 if and only if fhe simulation of Q resulfs 
in rejection (i.e., fhe measuremenf in Sfep 2.2 resulfs in |1)) and fhe initialization of fhe qubifs in Q succeeds 
(i.e., none of fhe measuremenfs in Sfep 2.4 resulfs in |1)) for all fhe N affempfs during Step 2. Suppose fhaf fhe 
qubifs in (Q, Rj) form fhe sfafe IV’r) = |0)®^ (8* |r) when entering Sfep 2.1, for some r in and lef pr be fhe 

probabilify defined by 

Pr = ||niQ|V'r)||^, 

where IIi is fhe projection operator acting over w qubifs defined by IIi = |1)(1| 0 Then, if is clear fhaf 

fhe measuremenf af Sfep 2.2 resulfs in |1) wifh probabilify exacfly pr- Hence, by telling Aq be fhe projection 
operator acting over w qubifs defined by Aq = (|0)(0|)®^ (8) the joinl probability that the measurement 

at Step 2.2 results in 11) and none of the measurements in Step 2.4 results in 11) is given by 

q, = ||AoQ^niQ|V’.)f. 

To prove the first inequality of the claim, notice that = ||AoQ^ni(5|V’r)|P is atmostp^ = ||niQ|'*/’r) |p- As 

the acceptance probability p^cciQ, k) of the /c-clean-qubit computation induced by Q is nothing but the expected 
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OR-Type Repetition Procedure 


1. Given a positive integer N, let I = [log N~\ + 1. Prepare a single-qubit register 0, an Z-qubit register C, and 
a fc-qubit register Q, where all the qubits in 0, C, and Q are supposed to be initially in state |0). For eaeh i 
in {1,, k}, let denote the single-qubit quantum register eorresponding to the ith qubit of Q. Prepare 
a {w — /c)-qubit register Rj, a single-qubit register Xj, and a fc-qubit register Vj, for eaeh j in {1,..., N}, 
where all the qubits in Rj, Xj, and Yj are supposed to be initially in the totally mixed state I /2. For eaeh i 
in {1,..., k} and j in {1,..., N}, let denote the single-qubit quantum register eorresponding to the ith 
qubit of Yj. 

2. For j = 1 to N, perform the following: 

2.1. Apply to (Q, Rj). 

2.2. Apply the CNOT transformation to Xj) with the qubit in being the eontrol. 

Apply [/+i(Z 2 i) to C if the eontent of is 0 (this ean be realized by first applying X to 
then applying the eontrolled-f7+i(Z2i) transformation A(f7+i(Z2i)) to C) with the qubit in 
being the eontrol, and further applying X to 

2.3. Apply qI to (Q, Rj). 

2.4. For eaeh i in {1,..., k}, apply the CNOT transformation to with the qubit in being 

the eontrol. 

Apply [7+1 (Z 2 i) to C if any of the qubits in Q eontains 1 (this ean be realized by first applying U+i (Z 2 i) 
to C, next applying X to eaeh of the qubits in Q, then applying the A:-eontrolled-f7+i(Z2i)^ transforma¬ 
tion A^(f7+i(Z2i)^) to (Q, C) with the qubits in Q being the eontrol, and further applying X to eaeh 
of the qubits in Q). 

3. Apply X to 0 if any of the qubits in C eontains 1 (this ean be realized by first applying X to eaeh of the 
qubits in 0 and C, and then applying the generalized Toffoli transformation A*(X) to (C, 0) with the qubit in 
0 being the target — and further applying X to eaeh of the qubits in C to preeisely realize the transformation 
in this step, whieh is unneeessary for the purpose of this proeedure). 

Measure the qubit in 0 in the eomputational basis. Aeeept if this results in |0), and rejeet otherwise. 


Figure 12: The OR-Type Repetition Procedure. 
value of 1 — Pj. over r in it holds that 

2-(—0 ^ (j, < 2-(—J] Pr = l-p.cc{Q,k), 

and thus, the probability is at most 1 — Pacc{Q,k) for the event that the eounter value remains zero after one 
iteration of Step 2, eonditioned that the eounter value was zero when starting that iteration. Overall, the probability 
that the eounter value remains zero when entering Step 3 is at most (1 — PscdQ, k))^, and thus, the proeedure 
results in aeeeptanee with probability at least 1 — (1 — Pacc(Q, k))^, whieh shows the first inequality. 

For the seeond inequality of the elaim, note that Aq = (|0)(0|)®^ (g) X]res(“-fc) {dr\, 

and thus, it holds that 

qr > \\\dr){A\Q^iiiQ\A)\f = \{A\Q^^iQ\A)\^ = ||niQ|V’.)||^ =p^ 
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Again using the fact that Pacc(Q) k) is nothing but the expected value of 1 — over r in it follows that 

^L-(u.-k) ^ p,y=(i-p,)y=(i-pacc(Q,fe))2, 

and thus, the probability is at least (1 — Pacc(<3, for the event that the counter value remains zero after one 
iteration of Step 2, conditioned that the counter value was zero when starting that iteration. Overall, the probability 
that the counter value remains zero when entering Step 3 is at least [(1 — PacciQ, k))'^]^ = (1 “ PacciQ, k))'^^. 
Accordingly, the procedure results in acceptance with probability at most 1 — (1 — Pacc(Q) k))^^, and the second 
inequality follows. □ 

6.2 Proofs of Theorems 3] and |5] 

Now we are ready to prove Theorems |4] and [5] First, we prove two lemmas. 

Lemma 24. For any constants c and s in M satisfying 0 < s < c < 1, 

QiogP(c,5)CQ[i]P(|, 


Proof. For any constants c and s in M satisfying 0 < s < c < 1, let A = (Ayes, Ano) be a problem in QiogP(c, s). 
Then A has a polynomial-time uniformly generated family {Qxjxes* of quantum circuits such that, for every 
input X, PucciQx, ^i(kl)) > c if X is in Ayes and Pacc(Qx, ^i(kl)) < s if x is in Ano for some logarithmically 
bounded function ki: N. The proof constructs a polynomial-time uniformly generated family of quantum 

circuits that puts A in Q[i]P(|, |). 

Fix an input x. 

From the circuit Qx, one first constructs a quantum circuit Rx that runs N attempts of Qx in parallel, and 
accepts if and only if at least fraction of the N attempts results in acceptance. Notice that this can be easily 
implementable if is a power of two, by combining the increment transformation with the threshold-check trans¬ 
formation discussed in Subsection 13.31 By taking a sufficiently large constant N that is a power of two, it holds 
that pa.cc{Rx, k 2 {\x\)) > ^ if X is in Ayes, and Pm:c{Rx, k 2 {\x\)) < if x is in Ano, where /c 2 : N is the 

logarithmically bounded function defined by /c 2 = Nki + log A -|- 1. 

From fhe circuif Rx, one consfrucfs aquanfum circuif according fo fhe One-Clean-Qubit Simulation 
Procedure wifh k = k 2 . By ProposifionfTTl if holds lhalpacc(f?^, 1) > 1 — 256 g| | x | ) > ^ if x is in Ayes, 

and pacciR'x^ 1) < 1 — if x is in Ano, where q: N is fhe polynomially bounded funcfion defined by 

q = 2 ^ 2 . 

Finally, from fhe circuif R'^, one consfrucfs a quanfum circuif R" according fo fhe RANDOMNESS AMPLIFI¬ 
CATION Procedure wifh fhe infeger A = 2 q(|x|). By Proposition (TH if holds fhaf 


1 1 


PacciR'xJ 1) > T: + n ( 1 “ 


2 2 


4g(|x|) 


2g(N) ^ ^ 

> - + - l-2q(|x|) 


2 2 


4q(|x|) 


3 

4 


if X is in Ayes, and 


Pacc(i?x,l) < 2 + 2 (^ 8g(|x|) 


15 


29 (| 3 ;|) 2 

^ 2^ 2l^“ 8q(|x|) 


t!9(N) 1 1 5 

^ 2 ^ ^ 8 


if X is in Ano- 

The claim follows wifh fhe polynomial-lime uniformly generafed family {R”}xej:* of quanfum circuils. 


□ 
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Lemma 25. For any constants c and s in M satisfying 0 < s < c < 1 and for any polynomially bounded func¬ 
tion p: —>■ N, 


Q[i]P(c,s) CQi„gP(l-2-^ i). 


Proof For any constants c and s in M satisfying 0 < s < c < 1, let A = (Ayes, ^no) be a problem in Q[i]P(c, s). 
Then A has a polynomial-time uniformly generated family {Qx}x&t.* of quantum circuits such that, for every 
input X, Pa.cc{Qx, 1) > c if X is in Ayes and Pacc(Qx, 1) < s if x is in Ano- For any polynomially bounded func¬ 
tion p: N, the proof constructs a polynomial-time uniformly generated family of quantum circuits that puts 

Ain QiogP(l-2-P,i). 

Fix an input x. 

From the circuit Qx, one first constructs a quantum circuit Rx that runs /(|x|) attempts of Qx in parallel, 
and accepts if and only if at least ^ fraction of the ^(|x|) attempts results in acceptance, where /: Z+ —N is 
some logarithmically bounded function such that Hn) is a power of two for every n in Z+. Again notice that 
this can be easily implementable by combining the increment transformation with the threshold-check transfor¬ 
mation discussed in Subsection 13.31 By taking such a function I that is at least , ^ ^ (logp -f 2), it holds that 

(C S) 

P^cciRx, k{\x\)) > 1 - 4 ^^ if X is in Ayes, and Pi,cc{Rx, ki\x\)) < if x is in Ano, where fc: Z+ N is 

the logarithmically bounded function defined by k = I log I -b 1- 

From the circuit Rx, one further constructs a quantum circuit R'^ according to the OR-Type Repetition 
Procedure with the integer N = p(|x|). By Proposition [23l it holds that 


p^cc{Rx,ki\x\) + [logp(|x|)] -h2) > 1 - 


4p(|x|) 


p(|a;|) 


> 


1 _ 2“P(I*I) 


if X is in Ayes, and 


P>,cc{Rx,k{\x\) + [logp(|x|)] -h2) < 1 - - 


1 


4p(|x|) 


2p{\x\) 


< 1 - 1 - 2p(|x| 


1 


4p(|x|) 


1 

2 


if X is in Ano- 

The claim follows with the polynomial-time uniformly generated family {R'x}xeE* of quantum circuits. □ 


6.2.1 One-Clean-Qubit Case 

By using Lemmas]^ and |25l Theorem[5]is proved as follows. 

Proof of Theorem^ For any constants c and s in M satisfying 0 < s < c < 1, let A = (Ayes, ^no) be a problem in 
QiogP(c, s). It is sufficient to show that the containment QiogP (c, s) C Q[4]P(1 — 2 f) holds for any polyno¬ 
mially bounded function p: —>• N. Indeed, the containment QiogP(c, s) C Q[i]P(l — |,2 P)is then proved by 

considering the complement problem A = (Ano, Ayes) of A: As A is in QiogP(c, 5) for the constants s = 1 — s and 
6 = 1 — c satisfying 0 < 5 < e < 1, the containment above to be proved implies that A is in Q[i]P(l — 2~p, |), 
and thus, that A is in Qq]P(l — f, 2~P'j , for any polynomially bounded function p: Z+ —>• N. 

From Lemma [24l A is in Q[i]P(|,|). Therefore, from Lemma |25l A is in QiogP(l — 2“'?, f), 
where q: Z+ —)■ N is the polynomially bounded function defined by q = p-|-4[logp] -|- 13. Hence, A has 
a polynomial-fime uniformly generafed family {Qxjxes* of quanfum circuifs such fhaf, for every inpuf x, 
PacciQx, k{\x\)) > 1 — 2 “ 9(1^1) if X is in Ayes and Pa,cc{Qx, ^(kl)) < ^ if X is in Ano, for some logarifhmically 
bounded function k: —)■ N. The resf of fhe proof is essentially fhe same as fhe proof of Theorem[2]in Secfion[5l 

Fix an input x. 
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From the circuit Q^, one first constructs a quantum circuit R^. according to the One-Clean-Qubit Simula¬ 
tion Procedure. By Proposition [TTl it holds that 




1 - (l - 


> 1 — = 1 — 


2-g(|a;|)+2 

ri(|x|) 


if X is in ^yes, and 

1 - ^ = 1-2- < p^,{R^, 1) < 1 - 2- = 1 - —^ 

if X is in ^no^ where ri: Z"*" —)• N is the polynomially bounded function defined by ri = 2^^^. 

From the circuit R^, one constructs a quantum circuit R'^ according to the RANDOMNESS Amplification 
Procedure with the integer N = ri(|x|) r 2 (|x|) for a polynomially bounded function r 2 : Z+ —N such that 
T 2 = p + 1- By Proposition [TH it holds that 


2-g(|a:|)+3\ ?'i(|3:|)?’2(|x|) 

^l(kl) ) 

> 2 + 2 ( ‘ “ n(|l|) r 2 {|x|) . j = 1 - >- 2 ( 1 x 1 ) ■ 2- 

if X is in Ayes, while 


Pacc(-Rx) — 2 2 ( 


1 1 


— ^ Pacc(-Rx) ^) — o o ( ^ ~ 


ri(|x| 


ri{\x\)r 2 {\x\) 


< i } lp -‘^ i '2{\ x \) , 
2^2 


1 _|_ 2-2r’2(|3;|) 


if X is in Ano- 

Finally, from the circuit R'^, one constructs a quantum circuit i?" according to the One-Clean-Qubit Sta¬ 
bility Checking Procedure with N to be the smallest integer that is a power of two and at least (6p(|x|))^ 
(i.e., N = > 2® = 64). If x is in Ayes, Proposition [19] ensures that 


Pacc(>R",l)>fl-r2(|T|)-2-'^(N)+2 


4-(6p(|x|))3-l 


> 1 - r2{\x\ 


^-q{\x\)+2 


2-(8p(|x|))3 


> 1 - 2 • (8p(|x|))3 • r 2 {\x\) • 2 -''(I^I )+2 > 1 - 2-P(l^l), 


where the first inequality follows from the fact that N is at most 2 • (6p(|x|))^, the second in¬ 
equality follows from the fact that 2 • (6p(|x|))^ < (8p(|x|))^, and the last inequality follows from 
the fact that q(|x|) = p(|x|)-|-dflog^dx])] -|-13, that (8p(|x|))^ = that 

T 2 (|t|) = p(|x|)-|-1 = and that the inequality log(n-|-1) < logn-|-1 holds for any n > 1. 

On the other hand, if x is in Ano, Proposition l20l ensures that 


Pacc(^", 1) < 


1 

2p(|x|) 


_|- 4.2-2>’2(|a:|) 


1 

p(|x|)’ 


where the last inequality follows from the facts that r 2 > p + I and that the inequality 2"^ > n holds. 

The claim follows with the polynomial-time uniformly generated family {R”}x^e* of quantum circuits. □ 
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6.2.2 Two-Clean-Qubit Case 

Again by using Lemmas |24] and |25l Theorem|4]is proved as follows. 


Proof of Theorem^ The proof is essentially the same as that of Theorem^ except that the circuit i?" is constructed 
according to the Two-Clean-Qubit Stability Checking Procedure rather than the One-Clean-Qubit 
Stability Checking Procedure. 

For any constants c and s in M satisfying 0 < s < c < 1, let A = (Ayes, Ano) be a problem in QiogP(c, s). 
From Lemma [24l A is in Q[i]P(|, I). Therefore, from Lemma 1^ A is in QiogP(l — 2 where q: TL'^ N 

is the polynomially bounded function defined by q = p + [logp] -|- 12. Hence, A has a polynomial-time uniformly 
generated family {(5x}a;es* of quantum circuits such that, for every input x, PacdQx, k{\x\)) > 1 — if x 

is in Ayes and PacdQx, A:(|t|)) < ^ if x is in Ano, for some logarithmically bounded function k: Zf' N. The 
rest of the proof is essentially the same as the proof of Theorem [T] in Section |5] 

Fix an input x. 

From the circuit Qx, one first constructs a quantum circuit Rx according to the One-Clean-Qubit Simula¬ 
tion Procedure. As in the proof of TheoremUl by Proposition [TTl it holds that 

2 -q{\x \)+2 

Pacd^xi 1) ^ 1 


if X is in Ayes, and 


1 - 


r(|x|) 


^ Pacd^Rx j 1 ) ^ 1 


1 


r(|x|) ’ r(|x|) 

if X is in Ano, where r : —)■ N is the polynomially bounded function defined by r = 2^+^. 

From the circuit Rx, one constructs a quantum circuit R'x according to the RANDOMNESS Amplieication 
Procedure with the integer N > a t(|x|), where a = | In 2 < 1.04. By Proposition [TSl it holds that 

2-g(|3;|)-l-3\ ar(\x\) 


Pacd^xi — 2 2 


1 - 


1 1 
^2 + 2 


1 — ar(|x|) • 


if X is in Ayes, while 
1 


2 — P&cd^xi — 2 2 


r(|x|) 
2-9(I*I)+3 
r(|x|) 


a. p(|fc|) 


= l-a-2 


-9(kl)+2 


1 - 


r(|x|) 


1 1 

< —I—e 
2 2 


-3 In 2 


“2 + 


1 

16 


if X is in Ano- 

Finally, from the circuit R^, one constructs a quantum circuit i?" according to the Two-Clean-Qubit Sta¬ 
bility Checking Procedure with N to be the smallest integer that is apower of two and at least 16(p(|x|) -|-1) 
(i.e., N = If X is in Ayes, Proposition |2T] ensures that 

Pacc(f2",2) > (1> l- 256(p(|x|) + l)-a-2-‘'(l^l)+2 > i_2-P(PI), 


where the first inequality follows from the fact that N is at most 2 • 16(p(|x|) -|- 1) = 32(p(|x|) -|- 1), and the 
last inequality follows from the fact that q(|x|) =p(|x|) -|- [logp(|x|)] + 12, that a = |ln2 < 1.04 < 2, that 
p{\x\) -|- 1 = 2*°s(p(l^l)+i), and that the inequality log(n -|- 1) < log n -|- 1 holds for any n > 1. 

On the other hand, if x is in Ano, Proposition |22] ensures that 

Pacc(i?",2) < 2-^’(l"l) 


(notice that the bounds for the probability pacd^d 1) above are such that Proposition |22] can be used to show the 
upper bound for the probability pacd^d 2))- 

The claim follows with the polynomial-time uniformly generated family {i?"}a;gs* of quantum circuits. □ 
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7 Completeness Results for Trace Estimation Problem 


This section proves Theorem^ which states that the Trace Estimation problem TREST(a, b) with any param¬ 
eters a and b satisfying 0<5<a<lis complete for both BQ^^gP and under polynomial-time many-one 

reduction. 

As presented in Ref. HSJOSlI . the following two properties are known to hold. 

Lemma 26. For any constants a and b in M satisfying —1 < 6 < a < 1, TREST(a, b) is in Q[i]P (5 + §) i + |)- 

Lemma 27. For any constants a and b in R satisfying 0 < b < a < 1, TREST(a, b) is hard for Q[]^]P(|, under 
polynomial-time many-one reduction. 

Theoreml^is then easily proved as follows, by combining Theorem[5]and these two lemmas. 

Proof of Theorem^ We show the BQ[^]P-completeness of TREST(a, b). The BQjQgP-completeness is then triv¬ 
ial, as BQiQgP = BQ[^]P due to Theorem |5] In what follows, fix the parameters a and 6, 0 < 6 < o < 1, of 
TREST(a, b). 

Eor the membership that TREST(a, b) is in BQjj^jP, notice that the class Q[i]P(^ f ’ I t) included in 
the class Q[i]P(c, s) for any constants c and s satisfying that c > max||, ^ + f} and s < min{^, ^ + |}^ due 
to Theorem |5l Therefore, the class Q[i]P(^ + f > ^ + I) included in BQj^jP also. Hence, the membership is 
immediate, as Lemma licensures that TREST(a, b) is in Q[i]P (5 + f > 5 + |)- 

Now for the BQj^jP-hardness of TREST(a, b), notice that BQjj^jP is included in the class Q[^]P(c, s) for any 
constants c and s satisfying that c > max{|, || and s < minji, ^|, due to Theorem [5] Therefore, BQjj^jP is 
included in the class Q[]^]P(|, also. Prom Eemmal27l TREST(a, b) is hard for Q[i]P(|, under polynomial¬ 
time many-one reduction, and thus, TREST(a, b) is hard for BQj^jP also, under polynomial-time many-one reduc¬ 
tion. □ 

Remark. As mentioned in Subsection l3.51 the completeness results of Theorem[Chold even under logarithmic-space 
many-one reduction, if the classes BQj^gP and BQjj^jP are defined with logarithmic-space uniformly generated 
family of quantum circuits. 


8 Hardness of Weak Classical Simulations of DQCl Computation 

This section deals with the hardness of weakly simulating a DQCl computation. Pirst, Subsection 18. ll reviews the 
notions of weak simulatability that are discussed in this paper. Subsection l8.2l then proves Theorems |7] and [8] 


8.1 Weak Simulatability 


Pollowing conventions, this paper uses the following notions of simulatability. 

Consider any family {Qxjxes* of quantum circuits. Eor each circuit Qx, suppose that m output qubits are 
measured in the computational basis after the application of Qx to a certain prescribed initial state (which will be 
clear from the context). Pet Px: S™ —5- [0,1] be the probability distribution derived from the output of Qx (i.e., 
Px{yi,..., Um) is the probability of obtaining the measurement result (yi,..., y^) in 5]™ when Qx is applied to 
the prescribed initial state). 

The family {Qx}xeY:* is weakly simulatable with multiplicative error c > 1 if there exists a family {Pf]x^T,* 
of probability distributions that can be sampled classically in polynomial time such that, for any x in S* and any 

( 2 / 1 , •••,2/m) in TT, 


Pxiui , • • • J ym) ^ , • • • J Um) ^ C Pxivi , • • • , ym)- 

C 


( 2 ) 
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Similarly, the family {Qx}x&t,* is weakly simulatable with exponentially small additive error if, for any poly- 
nomially bounded function q, there exists a family of probability distributions that can be sampled 

classically in polynomial time such that, for any x in S* and any (yi,..., ym) in 

\Px{yi, Kiyu ■ ■ ■ ,2/m)| < 

A few remarks are in order regarding the notions of weak simulatablity above. 

First, the notion of weak simulatablity with multiplicative error was first defined in Ref. IITD041I in a slightly 
different form. The definition taken in this paper is found in Refs. llBJSllllMFF14l . for instance. The version in 
Ref. IITD04I uses the bound \Px{yi,.. ■ ,ym) - P'xiyi^ ■ ■ ■ ^ym)\ < £ Px{yi, ■ ■ ■ ,ym) instead of the bounds dH), 
and these two versions are essentially equivalent. The results in this paper hold for any e in [0,1) when using the 
version in Ref. IITD041 . 

The notion of weak simulatability with exponentially small additive error was introduced in Ref. IITYT14L and 
was used also in Ref. IITTYT15I . 

As the notion of weak simulatablity with multiplicative error is often used when discussing the classical simu¬ 
latability of quantum models llTD041^JSllllAA13[lNVdN131lJVdN141lMFF141lBrol5l . the hardness result on the 
DQCl model under this notion certainly makes it possible to discuss the power of the DQCl model along the line 
of these existing studies. As discussed in Refs. IIBJS111IAA13I . however, a much more reasonable notion is the 
weak simulatability with polynomially small additive error in total variation distance. Proving or disproving clas¬ 
sical simulatability under this notion is one of the most important open problems in most of quantum computation 
models including the DQCl model. 


8.2 Proofs of Theorems |7] and |8] 

First, Theorem|7j stating that the restriction to the DQCl computation does not change the complexity classes NQP 
and SBQP, can be easily proved by using the One-Clean-Qubit Simulation Procedure presented in Sub¬ 
section O 


Proof of Theorem^ It suffices to show that co-NQP C co-NQ[^]P and co-SBQP C co-SBQ[^]P. 

We first show that co-NQP C co-NQjj^jP. 

Consider any problem A = (Ayes, Ano) is in co-NQP, and let {Qxyx&T,* be a polynomial-time uniformly gen¬ 
erated family of quantum circuits that witnesses this fact. For each x in S*, the circuit Qx acts over u)(|x|) qubits, 
for some polynomially bounded function w: IP —s- N. By the definition of NQP, for every input x, the acceptance 
probability Pa.cc{Qxi of the circuit Qx is one if x is in Ayes, while it is less than one if x is in Ano- 

Fix an input x, and consider the quantum circuit Rx corresponding to the One-Clean-Qubit Simulation 
Procedure induced by Qx and m(|x|). By the properties of Qx, Proposition [TT] ensures that the acceptance 
probability Pa,cc{Rx, 1) of the one-clean-qubit computation induced by Rx is one if x is in Ayes, while it is less than 
one if X is in Ano, which implies that A is in co-NQ[^]P. 

Now we show that co-SBQP C co-SBQ[^]P. 

Consider any problem A = (Ayes, Ano) is in co-SBQP. Then, by the amplification property of SBQP pre¬ 
sented in Ref. | Kupl5[ , there exists a polynomial-time uniformly generated family {Qxjxes* of quantum circuits 
such that, for every input x, Qx acts over u)(|x|) qubits, for some polynomially bounded function w: IP —)■ N, and 
the acceptance probability Pa,cc{Qx, w^(|t|)) of the circuit Qx is at least 1 — 2~PPP‘^ if x is in Ayes, while it is at 
most 1 — 2~PP if X is in Ano, for some polynomially bounded function p: IP N. 

Fix an input x, and consider the quantum circuit Rx corresponding to the One-Clean-Qubit Simulation 
Procedure induced by Qx and m(|x|). By the properties of Qx, Proposition [TT] ensures that the acceptance 
probability Pa,cc{Rx, 1) of the one-clean-qubit computation induced by Rx is at least 


1 _ 


1 - (l - 2-PPP^ f 


> 1 — = 1 — 2“"’(l*l)-p(|a;|)-i^ 
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if X is in ^yes, while it is at most 

1 _ 2“"'(l^l) 1^1 — (^1 — = 1 — 

if X is in which ensures that A is in co-SBQjj^jP. □ 

In fact, the argument used to show that SBQP = SBQj^jP in the proof of Theorem|7]can be extended to prove 
the following amplification property of SBQj^jP, which is analogous to the cases of SBQP and SBP. 

Theorem 28. For any polynomially bounded function p: 'lA —>• N, there exists a polynomially bounded func¬ 
tion q: N such that 

SBQ[pP C Q[i]P(2-'? • (1 - 2-Q, 2-'' • 2-^). 


Proof It suffices fo show fhaf for any polynomially bounded funcfion p : lA N, fhere exisfs a polynomially 
bounded funcfion q: lA N such fhaf co-SBQP C Q[^]P(l — 2“'^ • 2“^, 1 — 2“'^ • (1 — 2“^)) holds, due fo 
Theorem |7] 

Fix a polynomially bounded funcfion p: lA —)• N, and consider any problem A = (Ayes, j4no) is in co-SBQP. 
By fhe amplificafion properfy of SBQP presenfed in Ref. [KuplS |, fhere exisfs a polynomial-fime uniformly gen- 
erafed family {Qx}xe'E* of quanfum circuifs such fhaf, for every inpuf x, Qx acfs over m(|x|) qubifs, for some 
polynomially bounded funcfion w: lA —)• N, and fhe accepfance probabilify Pacc{Qx,w{\x\)) of fhe circuif Qx is 
af leasf 1 — 2“’' • 2“^“^ if x is in Ayes, while if is af mosf 1 — 2“^' • (1 — 2~p~^) if x is in Ano, for some polyno¬ 
mially bounded function r : Z+ ^ N. 

Fix an inpuf x, and consider fhe quanfum circuif Rx corresponding fo fhe One-Clean-Qubit Simulation 
Procedure induced by Qx and t(;(|x|). By fhe properties of Qx, Proposition [TtI ensures fhaf fhe accepfance 
probabilify Pacc{Rx, 1) of fhe one-clean-qubif compufafion induced by Rx is af leasf 

1 _ 2“"'(l®l) 1 — (i — 

> 1 — — (l — = 1 — • 2“ 


if X is in Ayes, while if is af mosf 


1 - 2 


-w)(|a;|) 


1 - 1-2 


-r{\x\) 


( 1-2 


-p(|a;|)-l 


= 1 — • (l — < 1 — • (l — 

if X is in Ano, which implies fhaf A is in Qji]P(l — 2“'? • 2~p, 1 — 2“'? • (1 — 2“^)) for g = tu -|- r, and fhe claim 
follows. □ 


Now we are ready fo prove Theorem |8] More formally, we prove fhe following sfafemenf. 

Theorem 29. Suppose that any polynomial-time uniformly generated family of quantum circuits, when used in 
DQCl computations, is weakly simulatable with multiplicative error c> 1 or exponentially small additive error. 
Then PH = AM. 

Theorem [^follows direcfly from Lemmas |30l |3ll and[32]below, combined wifh fhe facf fhaf AM C PH. 

Lemma 30. Suppose that any polynomial-time uniformly generated family of quantum circuits, when used in 
DQCl computations, is weakly simulatable with multiplicative error c > 1 (resp., exponentially small additive 
error). Then NQP C NP (resp., NQP C SBPj. 
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Proof. Fix any problem A = (Ayes, Ano) in NQP. By Theorem|7j there exists a polynomial-time uniformly gen¬ 
erated family {Rx}x&i:* of quantum eireuits sueh that, for every x G S*, the acceptance probability Pa,cc{Rx, 1) 
of the DQCl computation induced by Rx is nonzero if x is in Ayes, while it is zero if x is in Ano- From the 
assumption of this lemma, there exists a polynomial-time uniformly generated family {R'^}xe-E* of randomized 
circuits that weakly simulates {-Rxjxes* with multiplicative error c > 1. Now the definition of weak simulatability 
with multiplicative error ensures that the probability that the circuit R'^ outputs 0 (corresponding to acceptance) is 
nonzero if and only if Pg^^^.{Rx, 1) is nonzero, which happens only when x is in Ayes- This implies that A is in NP. 

In the case where {R'^}x;eE* weakly simulates {Rx}xeY:* with exponentially small additive error, the proof uses 
the fact that the membership of A in NQP is witnessed by a polynomial-time uniformly generated family {Qx}x&'E* 
of quantum circuits, where each Qx is composed only of Hadamard, Toffoli, and NOT gates (more precisely, each 
Qx is composed only of Hadamard, T, and CNOT gates, satisfying the definition of NQP in the present paper, but 
may be assumed to be composed in such a way that T and CNOT gates are used only for applying Toffoli and NOT 
transformations). Notice that, for such Qx, the acceptance probability, if it is nonzero, must be at least for 

some polynomially bounded function p: Z+ —)• N. Hence, from Theorem|7]and its proof, the family {Rx}xei:* of 
quantum circuits that witnesses the membership of A in NQj^jP = NQP may be assumed to be such that, for each 
Rx, the acceptance probability PacciRx, 1), if h is nonzero, must be at least 2~ for some polynomially bounded 
function q: —)• N. Furthermore, from the definition of weak simulatability with exponentially small additive 

error, the family {R'xjxeT.* may be assumed to simulate {f?a,}xes* with additive error at most This 

implies that the probability that R'x outputs 0 is at least 3 • 2~ jf x is in Ayes, while it is at most 2~ 

if X is in Ano, which ensures that A is in SBP. □ 

Lemma 31. //NQP C NP, then PH C AM. 

Lemma 32. //NQP C SBP, then PH C AM. 

The proof of Lemma [3l] requires the notion of the BP operator, whereas the proof of Lemma [32] requires the 
notion of the BP operator, a variant of the BP operator introduced in Ref. IIT0921I . 

For any complexity class C of promise problems, a promise problem A = (Ayes, Ano) is in BP • C iff there exist 
a promise problem B = (Byes, Bno) in C and a polynomially bounded function r: N such that, for every x 

in S*, it holds that 

X G Ayes ^ G : (x, z) G ^ • 2^(1"') and 

xGAno ^|{zGS^(N):(x,QG//no}| 

Similarly, for any complexity class C of promise problems, a promise problem A = (Ayes, Ano) is in BP • C 
iff for any polynomially bounded function q: —>• N, there exist a promise problem B = (//yes, -Bno) in C and a 

polynomially bounded function r: TLA —)■ N such that, for every x in S*, it holds that 

X G Ayes {x,z) G Byes}| > (1 - 2-''(l^l)) • and 

xGAno ^ |{z G (x,Q G Bno}| > (l - 2"• 2''(l^l). 

It is easy to see that AM = BP • NP. By the standard error reduction of AM, one can also see that 
AM = BP • NP = I^ • NP. 

Now Lemma l3T] is proved as follows. 

Proof of Lemma\^ The claim follows from the following sequence of containments: 

PH C BP • co-C=P = BP • NQP C BP • NP = AM, 
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where the first inclusion is by Corollary 2.5 of Ref. 11X0921 or Corollary 5.2 of Ref. IITar93l . and we have used the 
fact that NQP = co-C=P 1FGHP991 . the assumption NQP C NP of this lemma, and the fact that AM = BP • NP, 
respectively. □ 

Similarly, Lemma [3^ is proved as follows. 

Proof of Lemma\^ The claim follows from the following sequence of containments: 

PH C . co-C=P = • NQP C . SBP C . AM = • NP = 1^ • NP = AM, 

where the first inclusion is by Corollary 2.5 of Ref. IITQ921I . the next equality uses the fact that 
NQP = co-C=P IIFGHP991I . the next two inclusions are by the assumption NQP C SBP of this lemma and by 
the fact that SBP C AM IIBGM06L and the last three equalities use Ae characterization of AM by BP • NP as 
well as Lemma 2.8 of Ref. IIT0921 on the removability of a duplicate BP operator. □ 

Remark. If any polynomial-time uniformly generated family of quantum circuits, when used in DQCl computa¬ 
tions, is weakly simulatable either with multiplicative error c > 1 or with exponentially small additive error, then 
the inclusion SBQP C SBP can also be proved, by using the fact that SBQP = SBQjj^jP proved in Theorem |7] 
as well as the amplification property of SBQP (or the amplification property of SBQj^jP proved in Theorem [2^. 
This can also prove Theorem [8] (more formally. Theorem l29l) . the hardness result on such weak simulatability of 
DQCl computations, as the collapse of PH to AM also follows from the assumption SBQP C SBP, due to the 
inclusion NQP C SBQP. 

Finally, the argument based on NQP and SBQP used to prove Theorem [8] (more formally. Theorem l29l) in this 
section can also be used to show the hardness of weak classical simulations of other quantum computing models. In 
particular, it can replace the existing argument based on PostBQP, which was developed in Ref. HBJSlllI and has 
appeared frequently in the literature IIAA131 |JVdN141 IMFF141ITYT141 ^rol51 ITTYT15I . This also weakens the 
complexity assumption necessary to prove the hardness results for such models, including the IQP model IIBJSIII 
and the Boson sampling IIAA131I (the polynomial-time hierarchy now collapses to the second level, rather than 
the third level when using PostBQP). Moreover, the hardness results for such models now hold for any constant 
multiplicative error c > 1, rather than only for c satisfying 1 < c < V2 as in Refs. IIB.LS111IMFFT41 . 

9 Conclusion 

This paper has developed several error-reduction methods for quantum computation with few clean qubits, which 
simultaneously reduce the number of necessary clean qubits to just one or two. Using such possibilities of error- 
reduction, this paper has shown that the Trace Estimation problem is complete for BQj^gP and BQj^jP. One of 
the technical tools has also been used to show the hardness of weak classical simulations of DQCl computations. 
A few open problems are listed below concerning the power of quantum computation with few clean qubits: 

• In the case of one-sided bounded error, can any quantum computation with logarithmically many clean qubits 
be made to have exponentially small one-sided error by just using one clean qubit, rather than two? A similar 
question may be asked even in the case of two-sided bounded error whether both completeness and soundness 
errors can be made exponentially small simultaneously by just using one clean qubit. 

• In the two-sided error case, is error-reduction possible even when a starting quantum computation with few 
clean qubits has only an inverse-polynomial gap between completeness and soundness? 

• Are DQCl computations provable to be hard to classically simulate under a more desirable notion of simu¬ 
latability, like those discussed in Refs. llBJSllllAA13ll ? Such results are known for other computation models 
like Boson sampling by assuming hardness of some computational problems IIAA1311BMS151IFU15I . 
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